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Elementary Statistics: Introduction 


The following modules are based on the award-winning Elementary 
Statistics online course by authors Barbara Illowsky and Susan Dean. The 
content presented here was designed to be used as a complementary 
resource with their Collaborative Statistics textbook/collection. 


Note:The source documents for this collection can be found at 
http://sofia. fhda.edu/gallery/statistics/index.html. 


Course Management 


The Course Syllabus provides instructors a basic framework for teaching 
this material to their students. Thi document is intended to serve as a 
starting point; instructors should use this document as a foundation for 
creating a learning experience customized to meet their students unique 
needs. 


Video Lectures 


As a part of their award-winning online course, the authors have provided a 
number of video lectures. These half-hour segments can be used for self- 
study or as a complement to the Collaborative Statistics textbook. The 
authors also provide videos instructing students on the use of the TI-83 
calculator as used in the textbook and course activities and exercises. 
Lecture Videos 


e Chapter 1: Sampling and Data 

e Chapter 2: Descriptive Statistics 

e Chapter 3: Probability Topics 

e Chapter 4: Discrete Distributions 

e Chapter 5: Continuous Random Variables 
e Chapter 6: The Normal Distribution 


e Chapter 7: The Central Limit Theorem 
e Chapter 8: Confidence Intervals 
e Chapter 9: Hypothesis Testing - Single Mean and Single Proportion 


Data 
e Chapter 11: The Chi-Square Distribution 
e Chapter 12: Linear Regression and Correlation 


TI-83 Calculator Video Tutorials 


e TI-83 Calculator Tutorial, Part 1 
e TI-83 Calculator Tutorial, Part 2 


Practice Exams, Problem Sets, and Quizzes 


A number of practice tests and problem sets are provided for student self- 
evaluation and to provide opportunities for students to practice key 
concepts introduced throughout the course. Solutions to these exercises are 
provided as feedback to aid student retention and understanding. These 
problem sets may be used as homework assignments or self-directed study 
aids. 

Skills Practice Exams 


e Skills Practice Exam 1: Chapters 1,2, & 12 
4 


e Skills Practice Exam 3: Chapters 7, 8,9, & 10 
Practice Final Exams 


e Practice Final Exam 1: Chapters 1 & 2 

e Practice Final Exam 2: Chapters 3 & 4 

e Practice Final Exam 3: Chapters 5,6, & 7 
e Practice Final Exam 4: Chapters 8,9, & 10 
e Practice Final Exam 5: Chapter 11 

e Practice Final Exam 6: Chapter 12 


In addition to the problem sets provided above, the following multiple- 
choice quizzes are provided as resources for instructors. These modules can 


be used as assignments or as templates for classroom assessments. Answers 
to these items are not provided. 
Quizzes 


e Chapter 1: Sampling and Data 

e Chapter 2: Descriptive Statistics 

e Chapter 3: Probability Topics 

e Chapter 4: Discrete Distributions 

e Chapter 5: Continuous Random Variables 

e Chapter 6: The Normal Distribution 

e Chapter 7: The Central Limit Theorem 

e Chapter 8: Confidence Intervals 

e Chapter 9: Hypothesis Testing - Single Mean and Single Proportion 

e Chapter 10: Hypothesis Testing - Two Means, Two Proportions, Paired 
Data 

e Chapter 11: The Chi-Square Distribution 

e Chapter 12: Linear Regression and Correlation 


Calculator Instructions 


The following module contains a number of resources related to the TI-83 
calculator and ways it can be used with the Collaborative Statistics textbook 
and curriculum. This resource addresses many different function on the 
calculator, including calculation of the outliers, discrete mean, standard 
deviation, and random numbers. 


e View the TI-83 Calculator Resources 


Elementary Statistics: Course Syllabus 


Course Description 


Elementary Statistics is an introduction to data analysis course that makes 
use of graphical and numerical techniques to study patterns and departures 
from patterns. The student studies randomness with emphasis on 
understanding variation, collects information in the face of uncertainty, 
checks distributional assumptions, tests hypotheses, uses probability as a 
tool for anticipating what the distribution of data may look like under a set 
of assumptions, and uses appropriate statistical models to draw conclusions 
from data. 


The course introduces the student to applications in engineering, business, 
economics, medicine, education, the sciences, and other related fields. The 
use of technology (computers or graphing calculators) will be required in 
certain applications. 


Texts, Materials, and Plug-ins 


Texts 


These course materials are designed for use with Collaborative Statistics by 
Barbara Illowsky and Susan Dean. This text may be used online or can be 
downloaded in PDF format at no cost through the Connexions website, or 
you may choose to purchase a low-cost printed copy using the "Order 
Printed Copy" link provided on the collection home page linked above. 


Instructors wishing to customize this textbook can do so by creating a 
Connexions account. Connexions accounts are free and allow users to rip, 
mix, and burn content by updating modules and creating custom collections 
of educational content. Please see the Connexions website to learn more 
about Connexions and how you can use it to customize your students' 
learning experience at absolutely no cost. 


Materials 


Required Calculator: The T1-83 calculator is required. There are many 
examples that use the TI-83 calculator and contain the calculator 
instructions. YOU WILL BE TAUGHT HOW TO USE THE 
CALCULATOR IN THE COURSE LESSONS. 


Throughout the course, you will be given instructions for the TI-83 
Calculator. Labs and projects make use of the TI graphing calculator and 
may be done individually or in groups of up to four. 


Please download the T1I-83 calculator guidebook from this TI-83 Site. 
Follow the links for "TI-83 Plus Silver Edition" OR "TI-83 Plus" and use 
the "Guidebooks" link. 


You can review a number of TI-83 resources at 
http://cnx.org/content/m17581/latest/ or review the video tutorials online 
(Part 1, Part 2). 


Other Calculators: TI-86 or TI-89 


You may use the TI-86 or TI-89 calculator if you have one, but you must 
have the programs loaded into it from the following TI-86 or TI-89 Web 


pages. 
Download the TI-86 Calculator Instructions and follow the following links: 


e TI-86 Advanced Statistics Program 

e Advanced Statistics Guide infstat1.86g 

¢ Collaborative Statistics: Workbook has many TI-86 calculator 
instructions in it. 


Download the TI-89 Calculator Instructions and follow these links: 


e Statistics with List Editor. 
e Find "Get the Statistics with List Editor App!" and "Statistics with List 
Editor Guidebook." 


Homework and Suggested Grading 


The purpose of homework is to help you learn the material in the course. 
You learn the most and do your best if you do the homework problems. You 
are expected to do the chapter PRACTICE in the workbook before 
attempting the homework. The answers to the Practice are in the back of the 
workbook. Then do the assigned odd numbered homework problems in the 
text and check those answers in the back of the text. 


Lowest 

Total Points (out 
Homework Points of 700) for: Percentage 
Frans (oe 0 300 A: 630 90-100% 
points each) 
Quizzes (12 @ 10 
points each, 3 lowest 90 B: 546 78-89% 
dropped) 
Babs (22730 pouits 60 C: 462 66-77% 
each) 
Ee ee 150 D: 385 55-65% 
points each) 
Final Exam 100 Pow 0-54% 


385 


Video Lecture 1: Sampling and Data 

This video lecture is designed as a supplement to the Sampling and Data 
chapter of Collaborative Statistics (col10522) by Barbara Illowsky and 
Susan Dean. In this chapter, students can expect to learn how to recognize 
and differentiate between key terms in statistics, apply various types of 
sampling methods to data collections, and create and interpret frequency 
tables. 

https://www. youtube.com/embed/vixNnZO8X9k 


Video Lecture 2: Descriptive Statistics 

This video lecture is designed as a supplement to the Descriptive Statistics 
chapter of Collaborative Statistics (col10522) by Barbara Illowsky and 
Susan Dean. In this chapter, students can expect to learn how to display data 
graphically and interpret graphs; recognize, describe and calculate the 
measures of location of data such as quartiles and percentiles; recognize, 
describe, and calculate the measures of the center of data; and recognize, 
describe, and calculate the measures of the spread of data. 
https://www.youtube.com/embed/X_VJfdMTpLlI 


Video Lecture 3: Probability Topics 

This video lecture is designed as a supplement to the Probability Topics 
chapter of Collaborative Statistics (col10522) by Barbara Illowsky and 
Susan Dean. In this chapter, students can expect to learn how to understand 
and use the terminology of probability; determine whether two events are 
mutually exclusive or independent; calculate probabilities using the 
Addition Rules and Multiplication Rules; and construct and interpret 
Contingency Tables, Venn Diagrams, and Tree Diagrams. 


Video Lecture 4: Discrete Distributions 

This video lecture is designed as a supplement to the Discrete Distributions 
of Collaborative Statistics (col10522) by Barbara Illowsky and Susan Dean. 
In this chapter, students can expect to learn how to recognize and 
understand discrete probability distribution functions; calculate and 
interpret expected values; recognize and apply the binomial probability 
distribution, the Poisson probability distribution, the geometric probability 
distribution, and the hypergeometric probability distribution; and classify 
discrete word problems by their distributions. 

https://www. youtube.com/embed/vy_eY1H_ DOU 


Video Lecture 5: Continuous Random Variables 

This video lecture is designed as a supplement to the Continuous Random 
Variables chapter of Collaborative Statistics (col10522) by Barbara 
Illowsky and Susan Dean. In this chapter, students can expect to learn how 
to recognize and understand continuous probability distribution functions in 
general; recognize the uniform probability distribution and apply it 
appropriately; and recognize the exponential probability distribution and 
apply it appropriately. 

https://www. youtube.com/embed/R944A GIpPog 


Video Lecture 6: The Normal Distribution 

This video lecture is designed as a supplement to the Normal Distribution 
chapter of Collaborative Statistics (col10522) by Barbara Illowsky and 
Susan Dean. In this chapter, students can expect to learn how to recognize 
the normal probability distribution and the standard normal probability 
distribution and apply each appropriately, and compare normal probabilities 
by converting to the standard normal distribution. 


Video Lecture 7: The Central Limit Theorem 

This video lecture is designed as a supplement to the Central Limit 
Theorem chapter of Collaborative Statistics (col10522) by Barbara Illowsky 
and Susan Dean. In this chapter, students can expect to learn how to 
recognize the Central Limit Theorem problems; classify continuous word 
problems by their distributions, and apply and interpret the Central Limit 
Theorem for Averages and the Central Limit Theorem for Sums. 


Video Lecture 8: Confidence Intervals 

This video lecture is designed as a supplement to the Confidence Intervals 
chapter of Collaborative Statistics (col10522) by Barbara Illowsky and 
Susan Dean. In this chapter, students can expect to learn how to calculate 
and interpret confidence intervals for one populations average and one 
population proportion; interpret the student-t probability distribution as the 
sample size changes; and discriminate between problems applying the 
normal and then student-t distributions. 


Video Lecture 9: Hypothesis Testing with a Single Mean 

This video lecture is designed as a supplement to the Hypothesis Testing: 
Single Mean and Single Proportion chapter of Collaborative Statistics 
(col10522) by Barbara Illowsky and Susan Dean. In this chapter, students 
can expect to learn how to differentiate between Type I and Type II Errors; 
describe hypothesis testing in general and in practice; and conduct and 
interpret hypothesis tests for single population means, known and unknown 
population standard deviation, and also proportions. 


Note: A captioned version of this video is currently unavailable. This will 
be updated shortly. 
https://www. youtube.com/embed/gpLOmMAZ _JfE 


Video Lecture 10: Hypothesis Testing with Two Means 

This video lecture is designed as a supplement to the Hypothesis Testing: 
Two Means, Two Populations, Paired Data chapter of Collaborative 
Statistics (col10522) by Barbara Illowsky and Susan Dean. In this chapter, 
students can expect to learn how to classify hypothesis tests by type; 
conduct and interpret hypothesis tests for two population means with 
standard deviations that are both known and unknown; conduct and 
interpret hypothesis tests for two population proportions; and conduct and 
interpret hypothesis tests for matched or paired samples. 

https://www. youtube.com/embed/KSgDoWWZevU 


Video Lecture 11: The Chi-Square Distribution 

This video lecture is designed as a supplement to the Chi-Square 
Distribution chapter of Collaborative Statistics (col10522) by Barbara 
Illowsky and Susan Dean. In this chapter, students can expect to learn how 
to interpret the chi-square probability distribution as the sample size 
changes; conduct and interpret chi-square goodness-of-fit hypothesis tests; 
conduct and intepret chi-square test of independence tests; and conduct and 
interpret chi-square single variance hypothesis tests. 

https://www. youtube.com/embed/S5jbIw0SI2s 


Video Lecture 12: Linear Regression and Correlation 

This video lecture is designed as a supplement to the Linear Regression and 
Correlation chapter of Collaborative Statistics (col10522) by Barbara 
Illowsky and Susan Dean. In this chapter, students can expect to learn how 
to discuss basic ideas of linear regression and correlation; create and 
interpret a line of best fit; and calculate and interpret the correlation 
coefficient and outliers. 

https://www.youtube.com/embed/hbKcpJaPknl 


Elementary Statistics: Skills Practice Exam 1- Lessons 1, 2, 12 


Questions 1 — 4 use the following table and shows the lengths (in minutes) of 64 
international phone calls using a prepaid calling card. 


Length of call Relative Cum. Relative 
(minutes) Frequency frequency Freq. 
4 25 0.3906 
14 15 
24 10 0.1563 
34 9 0.1406 
44 4 0.0625 
54 1 0.0156 1.00 
Exercise: 


Problem: The histogram of this data looks to be: 


¢ ASkewed right 
¢ BSkewed left 
e¢ CSymmetrical 


Solution: 


A 


Exercise: 


Problem: Which of the following box plots most accurately displays the data? 


« D 


a 


Solution: 


B 


Exercise: 


Problem: What percent of telephone calls were more than 24 minutes? 


e A15.63% 
¢ B21.88% 
¢ C62.5% 

¢ D78.13% 


Solution: 
B 
Exercise: 
Problem: Find the 80th percentile. 


e Al4 


e B24 
e C34 
e D70 


Solution: 


c 
Exercise: 
Problem: 


What can be said about a set of data when its standard deviation is small (but not 
zero)? 


e AThe data are far apart. 

e¢ BAI! of the data have the same value. 

e CThe mean of the data can never be zero. 
¢ DtThe data are close together. 


Solution: 


D 


Questions 6 and 7, refer to the following: A sample of students was taken to 
determine pulse rate. The data is shown: 


Pulse rate- 
beats per 54 58 65 68 72 76 80 90 98 
minute 


Frequency 


(# of 1 3 6 8 5 3 8 4 2 
Students) 


Exercise: 


Problem: Find the median and mode(s). 


A72, 68 and 80 
B72 and 80 
C76 and 68 
D76, 68 and 80 


Solution: 


A 
Exercise: 


Problem: 


Out of the entire college population of 24,000 students, approximately what 
percent of students are expected to have a pulse rate of 65? 


e AG 
e B40 
e C9 
e D15 


Solution: 


D 


For questions 8 and 9, refer to the following: By determining the average number 
of people in a car using the "Diamond" Carpool Lane, the California Highway Patrol 
is trying to decide if the number of people in a car using the "Diamond" Carpool Lane 
should be increased from 2 to 3. 

Exercise: 


Problem: 


The average number of people per car for all cars using the "Diamond" Carpool 
Lane is called the: 


e Aparameter 
e Bdata 
e Cvariable 


Dstatistic 


Solution: 


D 


Exercise: 


Problem: The number of people in 1 car is called the: 


e Aparameter 
e Bdata 
e Cvariable 
e Dstatistic 
Solution: 
6 
Exercise: 
Problem: 


I toss a fair coin a large number of times. Assuming the tosses are independent, 
which of the following is true? 


AOnce the number of flips is large enough, the number of heads will always 
be exactly half of the total number of tosses. For example, after 10,000 
tosses I should have exactly 5,000 heads. 

BThe proportion of heads will be about 1/2 and this proportion will tend to 
get closer to 1/2 as the number of tosses increases. 

CAs the number of tosses increases, any long run of heads will be balanced 
by a corresponding run of tails so that the overall proportion of heads is 
exactly 1/2. 

DAII of the above. 


Solution: 


B 


Questions 11 - 13, refer to the following: A sample of twenty people went on a 
cruise to Alaska. Their two-week weight gain is shown below (a weight loss is shown 
by a negative number.) 


Weight Gain Frequency 
a2 4 
0 5 
2 8 
5 2 
9 1 
Exercise: 
Problem: The middle 50% of the data is between and 
e AO and2 
e BO and9 
e C2 and9 
e D2 and2 
Solution: 
A 
Exercise: 


Problem: Find the average weight gain (in pounds). 


e Al.35 
e B2.74 
e C2 


¢ DThere is not enough information. 


Solution: 


A 
Exercise: 
Problem: 
What weight gain is 3 standard deviations above the mean (in pounds)? 
e A4.05 
¢ B8.19 


e €9.57 
¢ DThere is not enough information 


Solution: 


A 


For questions 14 - 19, use the following information: Kim, a personal trainer, was 
interested in whether or not there was a linear relationship between the number of 
visits her clients made to the gym each week and the average amount of time her 
clients exercised per visit. She took the following data. 


Client 1 2 3 4 5 6 
Number of visits per week 1 3 4 2 3 5 


Average time spent exercising per 
visit (hours) 


Exercise: 


Problem: The line that best fits the data is: 


© AY = —0.44 + 2.622 
¢ BY = 0.44 + 2.622 
© Cy = 2.62 + 0.442 
¢ Dy = 2.62 — 0.442 


Solution: 


A 


Exercise: 


Problem: Is the correlation coefficient significant? 


e AYes 

e BNo 

e Clit might be. 

e DNot enough information is given. 


Solution: 


A 
Exercise: 


Problem: 


Using the best fit line, estimate the average time spent exercising per visit for 4 
visits per week. 


e A? hours 

e BO.86 hours 
e Cl hour 

e D10.04 hours 


Solution: 


B 


Exercise: 


Problem: 


Kim used the best fit line to estimate the average time spent exercising per visit 
for her client Toby who visited the gym 7 times per week. Does the least squares 
line give an accurate estimate? 


e AYes 

¢ BNo 

e CMaybe 

¢ DNot enough information is given. 


Solution: 


B 


Exercise: 


Problem: If the correlation coefficient is —1, which answer is correct? 


e AThe slope of the best fit line is positive. 

¢ BThe slope of the best fit line is —1. 

e CThe data fit exactly on a line with positive slope. 
e DtThe data fit exactly on a line with negative slope. 


Solution: 


D 


Exercise: 


Problem: A scatter plot shows: 


e Athe direction and strength of a relationship between the independent and 
dependent variables. 

e Bthat there is a linear relationship between the independent and dependent 
variables. 

¢ Chow you can predict the dependent variable knowing the independent 

variable. 

Dnothing. The line of best fit is what is important. 


Solution: 


A 
Exercise: 
Problem: 
Suppose we are interested in the average grade on the first math 10 test taken by 
all students at De Anza during the spring 2002 quarter. We randomly choose 3 


students from each of the spring 2002 Math 10 classes as our sample. This 
sampling technique is 


e Asystematic 
e Bcluster 

¢ Cstratified 

e Dconvenience 


Solution: 


B 
Exercise: 
Problem: 
Tracy works at a local indoor soccer arena. He is interested in the proportion of 


people entering the area who spend money in the arena store. One night while he 
is working, Tracy counts the first 20 people who buy goods in the arena store. 


e Asystematic 
e Bcluster 

¢ Cstratified 

e Dconvenience 


Solution: 


D 


Skills Practice Exam 2: Chapters 3, 4, 5, & 6 


Questions 1 — 2 refer to the following: Below is the probability distribution function for the number of high 
school years that students at a local high school play on a sports team. 


x P(X =2) 

0 0.32 

1 0.12 

2 

3 0.18 

4 0.14 
Exercise: 


Problem: What is the probability that X = 2? 


e A0.24 
e BO.76 
e C0.32 
e DCannot determine 


Solution: 


A 
Exercise: 
Problem: 


Over the long run, the average number of years that we would expect students at this high school to play ona 
sports team is: 


e AO 
e B1.7 
e C2 
e D2.6 


Solution: 


B 
Exercise: 
Problem: 


According to the 2000 United States Census, 12.3% of the population is Black or African American. The 
probability that a randomly selected U. S. resident is NOT Black or African American is: 


e A0.123 
e BO.877 
e C0.754 
DCannot determine 


Solution: 


B 
Exercise: 
Problem: 
Assume the statistics final is a multiple choice test with 40 questions. Each question has four choices with one 


correct answer per question. If you were to randomly guess on each of the questions, what is the probability 
of getting exactly the expected number of correct answers? 


e A0.5839 
e BO.5605 
e €0.25 

e DO.1444 


Solution: 


D 


Exercise: 


Problem: In an exponential distribution, the mean is larger than the median. 


e Atrue 
e Bfalse 


Solution: 


A 
Exercise: 
Problem: 
In Fall 1999, students in one Math 10 section determined that the length of movies at the cinema was 


normally distributed with a mean of 148 minutes and a standard deviation of 19 minutes. Find the third 
quartile and interpret it. 


e A75 minutes; Three-fourths of the movie lengths fall below 75 minutes. 

¢ B160.8 minutes; Three-fourths of the movie lengths fall below 160.8 minutes. 
e C160.8; Three-fourths of the movies last 160.8 minutes. 

e D75 minutes; Three-fourths of the movies last 75 minutes. 


Solution: 


B 


Exercise: 


Problem: Which of the following is FALSE about data that follows the normal distribution? 


e AThe mean is the same as the mode. 

e BtThe standard deviation is the same as the mean. 

e CThe median is the same as the mode. 

e DMost data is within 3 standard deviations of the median. 


Solution: 


B 
Exercise: 
Problem: 


The graph showing the age of getting a driver’s license in California starts and peaks at age 16, and decreases 
from there. This shape most closely resembles what type of distribution? 


e a Normal 

e b Binomial 

e c Uniform 

e d Exponential 


Solution: 


D 


Use the following information for questions 9 and 10. The amount of time that a randomly chosen 6th grade 
student spends on homework per week is uniformly distributed from 30 to 120 minutes. 
Exercise: 


Problem: 


What is the probability that a randomly chosen 6th grade student spends at least 60 minutes per week on 
homework knowing that he/she will spend at most 80 minutes per week on homework? 


e al.20 

e b 0.6667 
e ¢ 0.2222 
e d0.4 


Solution: 


D 
Exercise: 
Problem: 


What is the expected amount of time that a randomly chosen 6th grade student spends on homework per 
week? 


e a 45 minutes 
e b 60 minutes 
¢ c 30 minutes 
e d75 minutes 


Solution: 


D 


Use the following information for questions 11 and 12. The length of time a randomly chosen 9-year old child 
spends playing video games per day is approximately exponentially distributed with a mean equal to 2 hours. 
Exercise: 


Problem: Find the probability that a randomly chosen 9-year old will play video games at most 3 hours. 


e a 0.7769 
¢ b0.9975 
e c 0.0025 
e d0.2231 


Solution: 


A 


Exercise: 


Problem: 70% of 9-year old children will play video games per day for at most how long? 


e a 0.60 hours 
e b 2.41 hours 
e c 0.71 hours 
e dCannot determine 


Solution: 


B 


Use the following information for questions 13 and 14. Research has shown that studying improves a student’s 
chances to 80% of selecting the correct answer to a multiple choice question. A multiple choice test has 15 
questions. Each question has 4 choices. 

Exercise: 


Problem: What is the distribution for the number of questions answered correctly when a student studies? 


a B(15, 0.80) 
b B(15, 0.25) 
c P(15) 

d P(6) 


Solution: 


A 


Exercise: 


Problem: 


Suppose that a student does not study for the test but randomly guesses the answers. What is the probability 
that the student will answer 7 or 8 questions correctly? 


e a0.2951 
e b 0.0524 
e c 0.0131 
e dCannot determine 


Solution: 


B 
Exercise: 


Problem: 


A downtown hotel determined the probability of finding X taxicabs waiting outside the hotel anytime 
between 5 PM and midnight. The information is shown in the table. 


be P(X) 
1 0.0667 
2 0.1331 
3 0.2000 
4 0.2667 
5 0.3333 


What is the average number of taxicabs that are expected to be waiting outside the hotel anytime between 5 
PM and midnight? 


e a3.7 
e b3 
e c0 
e d15 


Solution: 


A 


Exercise: 


Problem: 


. During the registration period for a new quarter, the De Anza College Registrar’s Office processes 
approximately 75 applications per hour, on the average. What is the probability that it will process more than 
80 applications for a randomly chosen hour? (This is a Poisson problem. If you did not cover the Poisson 
Distribution, then skip this problem.) 


e a 0.0379 
¢ b 0.2589 
e c 0.7411 
e d0.0248 


Solution: 


B 


Questions 17 - 19 refer to the following: P(T) = 0.69 P(S) = 0.5, P(S|T) = 0.5 
Exercise: 


Problem: Events S and T are: 


¢ a mutually exclusive 

e b independent 

¢ c mutually exclusive and independent 

e dneither mutually exclusive nor independent 


Solution: 


B 


Exercise: 


Problem: Find P(S AND T) 


e¢ a 0.3450 
¢ b 0.2500 
e c 0.6900 
e dl 


Solution: 
A 
Exercise: 


Problem: Find P(S OR T) 


e a 0.6900 
e b1.19 

e c 0.8450 
e dO 


Solution: 


@ 
Exercise: 
Problem: 


Based on data from the US Census Bureau the average age of US residents is 36.31 with a standard deviation 
of 21.99. The data is normally distributed. The notation for the distribution is: 


° aX ~N(36.31, 21.99) 
° bX~N(21.99, 36.31) 
* cX ~ B(36.31, 22) 

° dX ~U(0, 36.31) 


Solution: 


A 


Exercise: 


Problem: In a binomial distribution we: 


e acount the number of successes until a failure is obtained 

e b count the number of trials until a success is obtained 

e c count the number of successes in a finite number of trials 

e dcount the number of trials until the number of successes equals the number of failures 


Solution: 


C 
Exercise: 
Problem: 


Certain stocks have a probability of 0.6 of returning a $100 profit. They also have a probability of 0.4 of 
having a loss of $300. Over the long run, what is the best thing to do to maximize your profit, and why? 


¢ a Invest in the stocks because there is a greater probability of making money than losing money. 


¢ b Do not invest in the stocks because the dollar amount for each loss is greater than the dollar amount for 
each gain. 


¢ c Invest in the stocks because making $100 per stock is preferred to losing $300 per stock. 
e d Do not invest in the stocks because the expected value is a loss. 


Solution: 


D 


Questions 23 - 27 refer to the following table (data from the Institutional Research department of the 
Foothill-De Anza Community College District for De Anza College). 


Black Hispanic White Undeclared Total 


American Asian/Pacific 


Indian Islander 
Administrator 0 3 5 5 
Staff a 35 21 30 
Faculty 3 58 14 45 
Total 4 96 40 80 


Suppose that one De Anza College employee is randomly selected. 
Exercise: 


Problem: Find P (the employee is an Administrator) 


e a 278/34 
e b 304/616 
e c34/616 
e d80/616 


Solution: 


C 


Exercise: 


Problem: Find P (the employee is Faculty AND American Indian) 


e a 382/616 
e b 3/616 

© c3/4 

e d3/278 


Solution: 


B 


Exercise: 


Problem: Find P (employee is Staff OR Hispanic) 


e a 384/616 
¢ b 80/616 

e c 304/616 
e d354/616 


Solution: 


D 


Exercise: 


21 


201 


141 


363 


Problem: Find P (employee is an Administrator GIVEN the employee is Black) 


17 


33 


34 


304 


278 


616 


a 40/616 
e b5/34 
e c 5/616 
e d5/40 


Solution: 
D 
Exercise: 
Problem: Being an Administrator and an American Indian are 


¢ a mutually exclusive events 

e b independent events 

¢ c mutually exclusive and independent events 

e dneither mutually exclusive nor independent events 


Solution: 
A 
Exercise: 
Problem: P(X$gt; 4 or X < 2) is best described by which of the following graphs? 


ea 


-10-8-6-4-2 02 4 6 8 10 


“10-8 -6-4-2 02 4 6 6 10 


-10-8-6-4-2 02 4 6 8 10 


“10-8 -6-4-2 02 4 6 8 10 


Solution: 

B 
Questions 29 - 32 refer to the following: When a customer calls the "Help Line" at ABC Computer Software Co., 
the amount of time that a customer must wait "on hold" until somebody answers the line and helps the customer 


follows an exponential distribution with mean of 7.5 minutes. 
Exercise: 


Problem: What is the probability that a customer waits more than 10 minutes to receive help? 


e¢ a 0.2636 
¢ b0.75 

e c 0.7364 
e do 


Solution: 


A 


Exercise: 


Problem: What is the 40th percentile of wait times for customers calling the help line? 


e a 6.87 minutes 
e b 3.83 minutes 
e c 0.68 minutes 
e d 0.122 minutes 


Solution: 
B 
Exercise: 

Problem: The customer wait time that is 1 standard deviation above the mean is: 
e a 2.17 minutes 
e b 7.5 minutes 
e c 9.67 minutes 
e d 15 minutes 

Solution: 


D 


Exercise: 


Problem: The probability that a customer calling the help line waits exactly 6 minutes for help: 


e al 

e b0.45 
e ¢€ 0.55 
e d0.8 


Solution: 


A 


Questions 33 — 35 refer to the following: ABC Delivery Service offers next day delivery of packages weighing 
between 2 and 20 pounds in a certain city. The have found that the weights of the packages they deliver are 
uniformly distributed between 2 and 20 pounds. 

Exercise: 


Problem: What is the probability that a package weighs between 10 and 15 pounds? 


e a0.2778 
e¢ b 0.5556 
e ¢ 0.2500 
e d0.8333 


Solution: 


A 
Exercise: 


Problem: 
Given that a package weighs less than 10 pounds, what is the probability that it weighs less than 5 pounds? 


¢ a 0.1667 
e b 0.6250 
e ¢ 0.3750 
e d0.5000 


Solution: 


C 
Exercise: 
Problem: 35% of packages weigh less than how many pounds? 
e a 7.8 pounds 
e b 8.3 pounds 


e c 11.7 pounds 
e d13.7 pounds 


Solution: 


B 
Exercise: 


Problem: 


Suppose that the probability that an adult in California will watch a Giant’s World Series game is 65%. Each 
person is considered independent. Of interest, is the number of adults in California we must survey until we 
find one who will watch a Giant’s World Series game. What is the probability that you must ask 2 or 3 
people? (This is a geometric problem. If you did not cover the geometric distribution, then skip this problem.) 


e a 0.6500 
¢ b 0.3071 
e € 0.2275 
e d0.0796 


Solution: 


B 


Questions 37— 39 refer to the following: The amount of time De Anza students work per week is approximately 
normally distributed with mean of 18.17 hours and a standard deviation of 12.92 hours. 
Exercise: 


Problem: The median is: 


e a Not enough information 
e b 12.92 

¢ c2.0 

e d 18.17 


Solution: 


D 


Exercise: 


Problem: The 90th percentile for the amount of time De Anza students work per week is: 


e al.6l 
e b 18.17 
e c90 

e d 34.7 


Solution: 
D 
Exercise: 
Problem: Which of the following is NOT TRUE about the normal distribution? 
e athe mean, median and mode are equal 


e b the curve is skewed to the right 
e c the curve never touches the x-axis 


e d the area under the curve is one. 


Solution: 


B 


Exercise: 


Problem: We use the z-score to: 


¢ a compare normal distributions with different averages and standard deviations 

¢ b drive statistics students nuts 

¢ c compare exponential distributions with the same average 

¢ dcompare uniform distributions with different minimum and maximum numbers 


Solution: 


A 


Elementary Statistics: Skills Practice Exam 3- Lessons 7,8, 9, 10 


Questions 1, 2, and 3 refer to the following: 

In a study of a sample of 35 Computer Science majors and 40 Business 
majors, the average number of science fiction books each group read per 
year was recorded. The Computer Science majors read an average of 11 
books with a standard deviation of 4 books. The Business majors read an 
average of 9 books with a standard deviation of 4.5 books. We are 
interested in whether the average number of science fiction books read by 
the Computer Science majors is the same as the number read by the 
Business majors. CS = Computer Science majors B = Business majors 
Exercise: 


Problem: The alternate hypothesis is: 


e amCS =mB 
e bmCS ! mB 

e cmCS < mB 
e dmCS > mB 


Solution: 


B 


Exercise: 
Problem: The exact distribution for the test is: 
e aNormal 
e bStudent-t with df » 72.9775 
e cExponential 
e dUniform 


Solution: 


C 


Exercise: 


Problem: The p-value is 


e a 0.0452 
¢ b 2.0376 
e € 0.0226 
e dO 


Solution: 


A 


Questions 4 and 5 refer to the following: 

Suppose that a random survey of 10 teenagers found that the average 
amount of time they spend on the Internet each day is 3.2 hours with a 
sample standard deviation of 0.78 hours. 

Exercise: 


Problem: 


The point estimate of the population average amount of time teenagers 
spend on the Internet each day is 


e a0.78 hours 
e b 32 hours 
e c 3.2 hours 
e dunknown 


Solution: 


C 


Exercise: 


Problem: 


An 87% confidence interval for the average amount of time a teenager 
spends on the Internet each day would be 


° a(2.71, 3.68) 
° b (2.83, 3.57) 
© ¢(2.64, 3.76) 
 d(2.79, 3.61) 


Solution: 


D 
Exercise: 


Problem: 


Which of the following is TRUE about a 99% confidence interval for 
the true average amount of time that teenagers spend on the Internet 
each day? 


e TWe are 99% confident that the true average time that teenagers 
spend on the Internet per day lies within the confidence interval. 

e IIThe confidence interval contains 99% of the data values 
collected. 

e II199% of all the confidence intervals constructed this way 
contain the true average amount of time teenagers spend on the 
Internet each day. 


e al, II, and III 
e bl and II 

e cl and III 

e dl only 


Solution: 


C 


Questions 7 and 8 refer to the following: 

Suppose a random survey of 600 registered voters revealed that only 39% 
actually voted in the last primary election. We are interested in the 
population proportion of registered voters who actually voted in the last 
primary election. 

Exercise: 


Problem: 


The margin of error (error bound) for a 92% confidence interval for the 
true proportion of registered voters who actually voted in the last 
primary election is 


e a0.0176 
e¢ b0.3900 
e ¢€0.3551 
e d0.0349 


Solution: 


D 
Exercise: 


Problem: 


Suppose you want to redo the survey of registered voters to see how 
many voted in the last election but this time you survey 1000 
registered voters and find 340 voted in the last election. If the 
confidence level is maintained at 92% what happens to the confidence 
interval. 


e aThe confidence interval gets wider. 

e bThe confidence interval gets narrower. 

e cThe confidence interval stays the same. 

e dThe confidence interval cannot be calculated. 


Solution: 


B 
Exercise: 
Problem: 
A study of a certain brand of AA batteries yielded a sample mean 
lifetime of 450 minutes with a sample standard deviation of 92 


minutes. A hypothesis test was performed using the following 
hypotheses: 


Hf, :m = 480 
H,:m < 480 
The type I error for this hypothesis test is: 


e ato conclude that the average battery lifetime is less than 480 
minutes when, in reality, it is equal to 480 minutes 

e bto conclude that the average battery lifetime is equal to 480 
minutes when, in reality, it actually is equal to 480 minutes 

e cto conclude that the average battery lifetime is equal to 480 
minutes when, in reality, it is less than 480 minutes 

e dto conclude that the average battery lifetime is greater than 480 
minutes when, in reality, it is equal to 480 minutes 


Solution: 


A 


Exercise: 


Problem: 


The null hypothesis is “the percentage of men who score 45 points or 
more (out of 50 points) on a statistics test at De Anza College is the 
same as the percentage of women who score 45 points or more.” The 
alternate hypothesis is “the percentage of men who score 45 points or 
more (out of a 50 point test) on a statistics test at De Anza College is 
more than the percentage of women who score 45 points or more.” 


The appropriate hypothesis test to perform and distribution to use is 


¢ aa test of two population proportions, independent groups; 
Student-t distribution 

e ba test of a single population proportion; Normal distribution 

e ca test of two population means, independent groups; Normal 
distribution 

e da test of two population proportions, independent groups; 
Normal distribution 


Solution: 


D 


Questions 11 through 13 refer to the following: In a study of vehicle 
safety, 15 minivans were crash tested and the repair costs for each of the 15 
minivans were recorded. For these 15 minivans, the average repair cost was 
$1786 and the standard deviation was $937 (based on data from the 
Highway Loss Data Institute.) Suppose that you want to test the hypothesis 
that the average repair cost is under $2000. Assume that the underlying 
population of repair costs follows a normal distribution. 

Exercise: 


Problem: The correct null hypothesis for this test is 


e am? 1786 
e bm £ 2000 


e cm? 2000 
e dm < 2000 


Solution: 


C 
Exercise: 


Problem: 


At a 5% level of significance (a), the correct decision for this 
hypothesis test is 


e areject H, because a is more than the p-value. 

e breject H, because a is less than the p-value. 

e cdo not reject H, because a is more than the p-value. 
e ddo not reject H, because a is less than the p-value. 


Solution: 
D 
Exercise: 
Problem: The appropriate distribution for this test is 
¢ aNormal with standard deviation $937 
e bt with degrees of freedom = 15 


e ct with degrees of freedom = 14 
e dNormal with standard deviation $241.93 


Solution: 


c 


For problems 14 and 15: Buses on a particular route stop in front of De 
Anza College every 20 minutes between 3:00 p.m. and 1:00 a.m. The 
waiting times are equally likely. We asked the 33 people waiting at 6:45 
p.m. how long they had been waiting, and then calculated the average wait 
time for those people. 

Exercise: 


Problem: The distribution of the average wait times is: 


e aN( 10, 1. 0050 ) 
e bU(O, 20) 

e cN( 10, 5.7735) 
e dExp (1 20) 


Solution: 


A 
Exercise: 
Problem: 


The probability that the average wait time is no more than 15 minutes 
is 


eal 
b 0. 7500 
c 0. 7769 
e dO 


Solution: 


A 


Exercise: 


Problem: 


Which probability statement best describes the graph? The horizontal 
axis has the label Xbar. 


e a P( Xbar> 6) 

e b P( Xbar <3 ) 

e cP( Xbar>60rX <3) 
e dP( 3 < Xbar <6) 


Solution: 


D 


Questions 17 and 18 refer to the following: A radio news story claimed 
that half of all U.S. adults have Internet access. In a national poll about 
Internet usage (The Pew Internet Project), 12,638 U.S. adults were surveyed 
and it was found that 6413 of those surveyed had Internet access. At a 5% 
level of significance, perform a hypothesis test to test the claim made by the 
radio news story. 

Exercise: 


Problem: The hypothesis test is: 


¢ aright - tailed 
e b left - tailed 
e ctwo — tailed 
e dno-tailed 


Solution: 


C 


Exercise: 


Problem: The correct conclusion is: 


e a The percentage of U.S. adults that have Internet access is not 
one-half. 

¢ b more than half of all U.S. adults have Internet access. 

e chalf of all U.S. adults have Internet access. 

e dless than half of all U.S. adults have Internet access. 


Solution: 


C 


Questions 19 - 21 refer to the following: An organic fertilizer and a 
conventional chemical fertilizer are tested to determine if the organic 
fertilizer produces more blossoms per stalk. 43 pairs of seed (the pair of 
seeds come from the same parent plant) are treated. One seed is treated with 
the organic fertilizer and the other seed is treated with the conventional 
chemical fertilizer. After a growing season of identical watering and 
sunlight, the number of blossoms on each stalk in a matched pair is 
recorded. For each pair, the difference in the number of blossoms per stalk 
(organic fertilizer plant blossoms — conventional chemical fertilizer plant 
blossoms) is computed. _ The statistics are: xd = 2.2 sd =5.3n = 43 
Exercise: 


Problem: What type of hypothesis test is conducted? 


e a Test of a single population mean 

¢ b Test of two population proportions, independent groups 
e c Matched or Paired Samples 

e d Test of two population means, independent groups 


Solution: 


C 


Exercise: 


Problem: Select the appropriate alternate hypothesis. 


e apd£0 
¢ bud>0O 
e cpd> 2.2 
e dud£ 2.2 


Solution: 


B 


Exercise: 


Problem: What is the correct conclusion? 


e a The organic fertilizer does result in more blossoms per stalk 

e b The organic fertilizer does not result in more blossoms per 
stalk. 

e c The organic fertilizer results in the same number of blossoms 
per stalk 

e d The conventional chemical fertilizer does result in more 
blossoms per stalk 


Solution: 


A 


Problems 22 — 24 refer to the following: Based on data from the 2000 
Census the average age of a Baldwin County, Alabama resident is 39 years 
with a standard deviation of 22. 6 . The data is normally distributed. 
Exercise: 


Problem: The median age, in years, is: 


e a39 

e b 22.6 

e ¢c19.5 

e d. There is not enough information. 


Solution: 


A 
Exercise: 


Problem: 
If 100 residents are surveyed, the IQR for the average age is: 


e a0. 30 
e b30.5 
e €3.04 
e d.There is not enough information. 


Solution: 


C 


Questions 24 — 25 refer to the following: Students doing a statistics 
project at Central City College found that the amount of time a teenager 
spends cleaning his/her room each week is exponentially distributed with a 
mean of 20 minutes. 

Exercise: 


Problem: 


What is the probability that 50 randomly selected teenagers spend, on 
average, between 15 and 30 minutes cleaning their rooms each week? 


e a 0.7500 
¢ b0.2902 
e ¢c 0.2492 


e d0.9612 


Solution: 


D 
Exercise: 


Problem: 


Find the 70th percentile for the average amount of time 50 teenagers 
spend cleaning their rooms each week. 


e a70 
e b21.5 
e ¢ 30.5 
e d7.1 


Solution: 


B 


Elementary Statistics: Exam 1: Lessons 1 & 2 


Questions 1 — 2 refer to the following: 
A sample of 56 employees was taken to determine average pulse rate. The 
data is in the table below: 


Pulse Rate (beats per minute) Frequency (# of employees) 
54 1 
58 4 
65 7 
68 9 
72 10 
76 4 
80 10 
84 6 
90 3 
98 2 
Exercise: 


Problem: The mode(s) is/are: 


e A74 

e B72 and 80 

e C68 

e Dthe average of 72 and 80 


Solution: 
B 
Exercise: 
Problem: The variable is: 


e Athe number of employees 

e Ball employees 

e Cthe average pulse rate for the 56 employees 
e Dthe pulse rate of one employee 


Solution: 


D 


Questions 3 — 5 refer to the following: 
The table below shows the ages of 50 senior citizens who attend a local 
senior center. 


Age Frequency Relative Freq. Cum. Rel. Freq. 


36 9 0.18 


Age Frequency 


63 8 
68 17 
73 8 
80 5 
88 2 
Exercise: 


Problem: Find the IQR. 


e AO.5 
e B17 
e C10 


Relative Freq. 


0.36 


0.16 


0.10 


0.04 


e Dnot enough information 


Solution: 


C 


Exercise: 


Cum. Rel. Freq. 


1.00 


Problem: Which interval has the smallest percentage of data? 


e AThe interval 55.5 to 63.5. 
e BThe interval 63.5 to 68.5. 
e CThe interval 68.5 to 73.5. 
e DThe interval 73.5 to 88.5. 


Solution: 


D 


Exercise: 


Problem: What percent of the ages are at most 68? 


e A36 
¢ B70 
e C30 
e D66 


Solution: 


B 
Exercise: 
Problem: 


What can be said about a set of data when its standard deviation is 
zero? 


e Alf the data are ordered, they are very spread out from the mean. 

e BAI! of the data appear with the same frequency. 

e ClIf the data are ordered, they are very close to but different from 
the mean. 

e DThere is no mode. 


Solution: 


D 
Exercise: 


Problem: 


For the following data, which measure of central tendency would be 
the LEAST useful? Data: 


48 1111 11 27 27 27 27 27 1000 


e Amean 

e Bmedian 

e Cmode 

¢ Dstandard deviation 


Solution: 
A 
Questions 8 — 10 refer to the following: 


Sixty (60) college students were asked the number of units they are taking 
this quarter. The results are given in the following graph: 


Relative Frequency 


0.4 a 
0.3 ia 


0.2 


~ Ti ei 
Number 
10 11 12 13 14 15 16 of 
Units 
Exercise: 
Problem: 


The number of responses that were “12” OR “13” is approximately: 


e Al8 
e BO.3 
e C25 


e¢ DNot enough information 


Solution: 


A 


Exercise: 


Problem 


e Al5 
e B14 
e C13 
e D12 


: The third quartile is: 


Solution: 


‘s 


# of 
Videos 


0 


Frequency 
18 


26 


Relative 
Frequency 


0.29 


Cum. Rel. 
Freq. 


? 


# of Relative Cum. Rel. 


Videos Frequency Frequency Freq. 

4 | : ? 
Exercise: 

Problem: 


The sample from the table above was taken by randomly selecting one 
student from the administration’s official list of students and then 
choosing every 100th. This is an example of what kind of sampling? 


e Acluster 

e Bsystematic 
e Cstratified 

e Dconvenience 


Solution: 


B 
Exercise: 


Problem: 


How should you classify data from the following question: What is 
your blood pressure? 


e Aqualitative 


e Bquantitative — discrete 
e Cquantitative — continuous 


Solution: 


C 


Exercise: 


Problem: 


How should you classify data from the following question: What is 
your favorite vacation place? 


e Aqualitative 
e Bquantitative — discrete 
e Cquantitative — continuous 


Solution: 


A 
Exercise: 


Problem: 


A study is done to determine the average amount of tuition all San Jose 
State undergraduate students pay per semester. A sample of 100 
undergraduate San Jose State students is taken by dividing the students 
into freshmen, sophomore, junior, and senior years and selecting 25 
students from each. Each student is asked how much tuition he/she 
paid for the Spring 2002 semester. The type of sampling and the 
parameter are: 


e Acluster and the average amount of tuition all San Jose State 
students paid Spring 2002. 

e Bstratified and the average amount of tuition the 100 San Jose 
State students paid Spring 2002. 

¢ Ccluster and the average amount of tuition the 100 San Jose State 
students paid Spring 2002. 

e Dstratified and the average amount of tuition all San Jose State 
students paid Spring 2002. 


Solution: 


Elementary Statistics: Exam 2: Lessons 3 & 4 
Questions 1 — 4 refer to the following: 


The following table classified eighty-two children by age and favorite 
meals at a McDonald's ® restaurant. 


6 or below 7-12 13-15 Total 


Hamburger 6 12 16 34 
Chicken Nuggets 9 13 11 30 
Filet ‘o’ fish 2 5 8 15 
Totals 17 30 35 82 


Age (in years) against favorite foods 
Assume a child is randomly selected. 
Exercise: 

Problem: 


Find the probability of being “7-12” years old AND preferring 
“chicken nuggets” 


e AB 
° B22 
© C2 
e Dp 


Solution: 


A 
Exercise: 


Problem: 


Find the probability of being “13-15” years old OR preferring 
“Filet’o’fish”. 


Solution: 


D 
Exercise: 


Problem: 


Find the probability of “preferring Hamburger” given that the 
randomly selected child is 13-15 years old. 


Solution: 


B 


Exercise: 


Problem: 
The events “preferring Hamburger” and “being 13-15 years old” are: 


e AMutually exclusive 

e BlIndependent 

e CNeither mutually exclusive or independent. 
e DBoth mutually exclusive and independent. 


Solution: 


CG 
Exercise: 
Problem: 


E and F are two events such that P(E’) = 0.60, P(E or F’) = 0.90 
and P(E and F’) = 0.50. Find P(F). 


e A0.80 
e B0.30 
e C0.40 
e DO.10 


Solution: 


A 
Exercise: 
Problem: 
The probability that a randomly chosen adult resident of Bayview city 
owns a boat is 0.16. The probability that a randomly chosen adult rents 


an apartment is 0.30. The probability that the adult owns a boat given 
he/she rents an apartment is 0.20. 


e A0.048 
e BO0.24 
e C0.10 
e DO0.06 


Solution: 


D 


Exercise: 


Problem: Possessing a boat and renting an apartment are: 


e Aindependent events 

e¢ Bmutually exclusive 

e Cboth independent and mutually exclusive 

¢ Dneither independent nor mutually exclusive 


Solution: 


D 


Questions 8 — 9 refer to the following: 

A bag contains 4 red marbles and 5 blue marbles. Two marbles are 
randomly drawn without replacement. 

Exercise: 


Problem: 


Find the probability of the event “The first marble is red and the 
second is blue.” 


Solution: 


B 


Exercise: 


Problem: Find the probability that both marbles are red. 


Solution: 


C 
Exercise: 


Problem: 


Approximately 70% of U. S. adults had at least one pet as a child. We 
randomly survey 60 U. S. adults. We are interested in the number that 


had at least one pet as a child. The probability that at least 3 adults had 
at least one pet as a child means: 


e AP(X = 0)+ P(X =1)+ P(X = 2) 

e BP(X =0)+ P(X =1)4+ P(X = 2) + P(X = 3) 

© CP(X = 4) + P(X =5)+ P(X =6)+...+ P(X = 60) 

e DP(X = 3) + P(X = 4)+ P(X =5)+...+ P(X = 60) 
Solution: 


D 


Questions 11 — 12 refer to the following: 
A plumber has determined the possible number of house calls to be made 
each day, and their related probabilities: 


x =# of house calls P(x) 
0 0.10 
1 0.40 
2 0.25 
3 0.15 
4 0.10 
Exercise: 
Problem: 


What is the probability that he makes at least 1, but no more than 3 
house calls in a day? 


e A0.65 
e BO.80 
e €0.50 
e DO.40 


Solution: 


B 


Exercise: 
Problem: 


If the plumber charges a flat fee of $ 40 for a house call, the expected 
daily income is: 


° A$70 

e BS$175 
e C$400 
¢ D$1.75 


Solution: 


A 


Exercise: 


Problem: If X~B(40,0.2), then P(X > 11) = 


e A0.0432 
e BO.0001 
e €0.0875 
e DO.1608 


Solution: 


G 
Exercise: 
Problem: 
The Fizz—Full Soda Company knows that 4% of the bottles of soda it 
produces are filled with less soda than required. If one purchases 10 


bottles at random, the probability that at most 2 of these bottles will 
have less soda than required is: 


e A0.0519 
e B0.9938 
e €0.9418 
e DO0.0080 


Solution: 


B 


Questions 15 — 16 refer to the following: 

Assume the statistics final is a multiple-choice exam with 50 questions, 
each question having 5 choices, only one of which is correct. Assume you 
answer all questions at random (guessing). 

Exercise: 


Problem: 
The expected number of questions you would get correct would be: 


e AS 

e B10 
e C40 
e D45 


Solution: 


B 
Exercise: 
Problem: 


Based upon numerical calculations, would you be surprised if a person 
got exactly half of the questions correct? 


e Ayes, because it is impossible 
e Byes, because the probability is almost 0 
¢ Cno, because the probability is 0.50 


¢ Dno, because it is the most likely probability 


Solution: 


B 
Exercise: 


Problem: 


If sampling without replacement occurs, do the picks follow the 
Binomial Distribution? 


e AYes, because each pick is independent from the others. 

e BNo, because the probability of success on each pick changes. 
e CyYes, if we are counting the number of successes. 

¢ DNo, because we may not have any successes. 


Solution: 


B 
Exercise: 


Problem: 


Ninety-four percent of California community college transfers feel that 
their community college adequately prepared them to handle upper- 
division coursework at their transfer university. We randomly survey 
14 California community college transfers. We are interested in the 
number that feel that their community college adequately prepared 
them to handle upper division coursework at their transfer university. 
List the values that X , the Random Variable, may take on. 


© A1,2,3....,14 
° B1,2,3,...,94 
* CO,1,2,...,14 


¢ DO,1,2....,94 


Solution: 


C 


Elementary Statistics: Exam 3: Lessons 5, 6 & 7 


Questions 1 — 3 refer to the following: 

Assume the amount of money seventh—grade students spend on food each 
day at school is exponentially distributed with an average of $2.50. 
Exercise: 


Problem: Which graph best describes the distribution? 


eA 


Solution: 


B 
Exercise: 
Problem: 


Find the probability that a randomly selected seventh—grade student 
spends less than $4 a day on food. 


e A0.7981 
e BO.2019 
e €0.9999 
e DO.0001 


Solution: 


A 
Exercise: 


Problem: 


85% of the seventh—grade students spend more than what amount per 
day? 


e A$2.12 
¢ BS0.75 
° C$4.74 
¢ D$0.41 


Solution: 


D 


Questions 4 — 5 refer to the following: 

The amount of time that intermediate algebra students at Leland High 
School spend doing their homework per day is normally distributed with a 
mean 1.5 hours and standard deviation 0.75 hours. 

Exercise: 


Problem: 


If one student is randomly chosen, what is the probability that the 
student does intermediate algebra homework at least 2 hours per day? 


e A0.7475 
e BO0.4259 
e €0.2525 
e D0.6784 


Solution: 


GC 
Exercise: 
Problem: 


60% of these students spend at most how many hours doing their 
homework? 


e A1.69 hours 

e B1.31 hours 

e C1.5 hours 

e DO.2533 hours 


Solution: 


A 


Questions 6 — 7 refer to the following: 

Llamas are excellent pack animals. It is known that the weight of supplies 
carried by llamas follows a normal distribution with a mean of 62.5 pounds 
and a standard deviation of 6 pounds. 

Exercise: 


Problem: 


Find the probability that the weight of supplies carried by one 
randomly chosen llama is between 60 and 70 pounds. 


e A0.4441 
e BO0.5559 
e C0.8944 
« D1 


Solution: 


B 


Exercise: 


Problem: 


The middle 50% of weights of supplies carried by a randomly chosen 
llama is between and 


e AO and 62.5 pounds 

e B58.45 and 66.55 pounds 

e C56.5 and 68.5 pounds 

e¢ DThere is not enough information given. 


Solution: 


B 


Exercise: 


Problem: Which of the following are true for the normal distribution? 


¢ IMore values fall close to the mean than fall far away from the 
mean. 

e IThe mean and standard deviation cannot be the same. 

e IIA change in p causes the graph to shift to the left or right and 
changes the shape of the graph. 

e IVA change in s causes a change in the shape of the normal 
curve. 


e AI, IV 

e BI, Il, Ill, 1V 
e CI, I, Ill 

e DIIl, IV 


Solution: 


A 


Questions 9 — 13 refer to the following: 

The length of time junior high school students sleep per night follows an 
approximate uniform distribution from seven to eleven hours. Suppose we 
randomly select a junior high student. 

Exercise: 


Problem: 


Find the probability that the randomly selected student sleeps less than 
8 1/2 hours per night. 


e A.2143 
e BO.7727 
e €0.4705 
e DO0.375 


Solution: 


D 
Exercise: 


Problem: 


Find the probability that the randomly selected student sleeps eight to 
twelve hours per night. 


e AO 
e Bl 
e C0.75 
e DO.25 


Solution: 


C 


Exercise: 


Problem: 


On average, how long does a junior high school student sleep per 
night? 


e A.2143 
e¢ BO.7727 
e €0.4705 
e DO0.375 


Solution: 


B 
Exercise: 
Problem: 


On average, how long does a junior high school student sleep per 
night? 


e A9.6 hours 
e B6.5 hours 
e C7.8 hours 
e D8.4 hours 


Solution: 


D 
Exercise: 
Problem: 
We are interested in the probability that a randomly selected student 


sleeps less than eight hours, knowing that he/she sleeps less than ten. 
Which graph best depicts this situation? 


7 10 
e D 
ot 
3 
7 8 10 
Solution: 


C 


Elementary Statistics: Exam 4: Lessons 8, 9 & 10 


e Each question has exactly one best answer. 
e Each question is worth 3 points for a total of 54 points (4 bonus points possible). 


Questions 1 — 3 refer to the following: 

Last year, President Bush granted a tax-cut check to all income tax filers. In doing so, it was reported 
that he thought that at least 30% of the households would use the tax-cut check to increase spending. 
According to a report by the University of Michigan Research Center (Wall Street Journal, Feb. 26, 
2002), 220 of the 1000 people surveyed said that the 2001 tax-cut check they received has led them to 
increase spending. 

Exercise: 


Problem: The alternate hypothesis for this test is: 


¢ Ap>0.30 
¢ Bp>0.22 
¢ Cp>0.30 
° Dp>0.22 


Solution: 


Cc 


Exercise: 


Problem: A Type II error is to: 


e AConclude that the proportion of people that would use the tax-cut check to increase 
spending is less than 30% when in fact the proportion is at least 30%. 

¢ BConclude that the proportion of people that would use the tax-cut check to increase 
spending is at least 30% when in fact the proportion is less than 30%. 

e CConclude that the proportion of people that would use the tax-cut check to increase 
spending is less than 22% when in fact the proportion is at least 22%. 

e¢ DConclude that the proportion of people that would use the tax-cut check to increase 
spending is at least 22% when in fact the proportion is less than 22%. 


Solution: 


B 


Exercise: 


Problem: Which of the following is the correct decision to make for the test? 


e AReject the null hypothesis. 
e BDo not reject the null hypothesis. 
e CThe test is inconclusive. 


Solution: 


Cc 


Questions 4 — 6 refer to the following: 

According to an article by George Will (San Jose Mercury News, Feb. 28, 2002), the average U.S. 
consumption per person per year of French Fries is 28 pounds. Suppose that you believe that the 
average in Santa Clara County is not 28 pounds. You randomly survey 50 people in this county. The 
sample average is 24 pounds with a sample standard deviation of 10 pounds. Conduct an appropriate 
hypothesis test. 

Exercise: 


Problem: This test is: 


e Aleft-tailed 
e Bright-tailed 
e Ctwo-tailed 
e Dno-tailed 


Solution: 


C 


Exercise: 


Problem: The p-value for this test is: 


e A0.0068 
¢ BO.0034 
e¢ €0.0047 
¢ DO0.0136 


Solution: 


A 


Exercise: 


Problem: At the 5% level, the correct conclusion is: 


e AThe average consumption in Santa Clara County is less than 28 pounds. 
e BThe average consumption in Santa Clara County is not 28 pounds. 

e CThe average consumption in Santa Clara County is less than 24 pounds. 
e DThe average consumption in Santa Clara County is 24 pounds. 


Solution: 


B 


Questions 7 and 8 refer to the following: 

A hospital administrator wants to determine the proportion of emergency room patients that use the 
emergency room (ER) for non-emergency care. She randomly samples records from 350 ER patients 
and determines that 74 of those patients required only non-emergency care. 

Exercise: 


Problem: 


The administrator constructs a 90% confidence interval for the true proportion of all ER patients 
who use the ER for non-emergency care. The error bound for the proportion EBP for this 
confidence interval is: 


e A0.036 
e BO.072 
e¢ €0.030 
e DO.106 


Solution: 


A 
Exercise: 


Problem: 


If the same data were used but the confidence level used was 95% instead of 90%, the error 
bound for the proportion EBP would be: 


e Alarger 

e Bthe same 

e Csmaller 

e Dwe are unable to determine unless another sample is obtained 


Solution: 


A 
Exercise: 


Problem: 
What is meant by the term “95% confident” when constructing a confidence interval for a mean? 


e Alf we took repeated samples, approximately 95% of the samples would produce the same 
confidence interval. 

¢ BIf we took repeated samples, approximately 95% of the confidence intervals calculated 
from those samples would contain the sample mean. 

¢ CIf we took repeated samples, the sample mean would equal the population mean in 
approximately 95% of the samples. 

¢ DIf we took repeated samples, approximately 95% of the confidence intervals calculated 
from those samples would contain the true value of the population mean. 


Solution: 


D 


Questions 10 through 11 refer to the following: 

An aircraft manufacturer is testing a new procedure to be used in installing a certain component in an 
aircraft. For a random sample of 8 airplanes being assembled, the time (in minutes) required to install 
the component for each of these 8 aircrafts are: 


808487919195102106 


Assume the underlying population of installation times is approximately normally distributed. 
Exercise: 


Problem: 
Find the 90% confidence interval for the true mean installation time using this new procedure. 


e A(86.1, 97.9) 
¢ B(86.9, 91.1) 
¢ C(87.2, 96.8) 
e DNot enough information 


Solution: 


A 


Exercise: 


Problem: The value that is the center of the confidence interval is: 


e A91 
e B92 
e C93 
e Du 


Solution: 


B 
Exercise: 
Problem: 
The amount of soda contained in a can for a certain brand of soda is normally distributed with a 
population standard deviation of 0.1 ounces. A random sample of 40 cans of soda was selected 


and the amount of soda in each can was measured. The sample mean was 12.03 ounces and the 
sample standard deviation was 0.08 ounces. 


What is the appropriate distribution to use when calculating a confidence interval for the true 
mean amount of soda contained in all cans of this brand? 


e AThe student-t distribution, because the sample standard deviation is given. 

e BThe student-t distribution, because the repair costs are approximately normally distributed. 
e CThe standard normal distribution, because the population standard deviation is known. 

e DThe standard normal distribution, because the sample mean is known. 


Solution: 


G 


Questions 13 — 15 refer to the following: 

Two competing parcel-delivery firms in a large city make conflicting claims about which one delivers 
parcels in the shortest time. A random sample of 100 delivery times for the first company produces a 
sample mean of x1 = 37 minutes and a sample standard deviation of s1 = 10 minutes. A random 
sample of 100 delivery times for the second company produces a sample mean of x2 = 41 minutes and 
a sample standard deviation of s2 = 12 minutes. Conduct a hypothesis test to determine if the mean 
delivery time for the first company is less than that for the second company. 

Exercise: 


Problem: 


What is the appropriate distribution to use when calculating a confidence interval for the true 
mean amount of soda contained in all cans of this brand? 


e Ami < m2 
e Bml1 > m2 
e Cm1 £m2 
e Dmi1 2 m2 


Solution: 
D 
Exercise: 


Problem: The exact distribution for the hypothesis test is: 


e AThe normal distribution because the population standard deviations are given. 

e BThe student-t distribution because the population standard deviations are unknown. 
e CThe exponential distribution because the time of delivery decreases. 

e¢ DNot able to determine. 


Solution: 


B 


Exercise: 


Problem: f the p-value is 0.0056, the conclusion is: 


e AThe mean delivery time for the first company is higher than the mean delivery time for the 
second company. 

e BThe mean delivery time for the first company is no more than the mean delivery time for 
the second company. 

e CThe mean delivery time for the first company is at least equal to the mean delivery time for 
the second company. 

e DThe mean delivery time for the first company is less than the mean delivery time for the 
second company. 


Solution: 
D 
Questions 16 — 17 refer to the following: 


Participants in a random sample of 10 professional football players are placed on a yogurt-and-banana 
diet for one month. The weights before and after one month on the diet are as follow: 


Before 187 205 165 193 199 286 212 189 242 253 


After 175 193 167 190 197 240 210 189 221 255 


We want to determine if the yogurt-and-banana diet helps reduce the weight of the football players. 
Assume the weight of the professional football players is approximately normally distributed. 
Exercise: 


Problem: This is a test of: 


e ATwo independent population means, population standard deviations known. 

e BTwo independent population means, population standard deviations unknown. 
e CPaired or matched samples. 

e DTwo population proportions. 


Solution: 


C 


Exercise: 


Problem: The distribution for the hypothesis test is: 


e AStudent-t 
e BExponential 
e CNormal 


DUniform 


Solution: 


A 


Exercise: 


Problem: 


A newspaper/TV network survey was conducted to determine whether the percentage of adult 
males who favor the death penalty is greater than the percentage of adult females who favor the 
death penalty. A random sample of 800 adult males produced 480 who favor the death penalty., 
while a random sample of 800 adult females produced 410 who favor the death penalty. 


The Type I error is to: 


AConclude that the percentage of adult males who favor the death penalty is greater than the 
percentage of adult females who favor the death penalty when, in fact, the percentage of 
males is no more than the percentage of females. 

BConclude that the percentage of adult males who favor the death penalty is no more than 
the percentage of adult females who favor the death penalty when, in fact, the percentage of 
males is more than the percentage of females. 

CConclude that the percentage of adult males who favor the death penalty is greater than the 
percentage of adult females who favor the death penalty when, in fact, the percentage of 
males is actually greater than the percentage of females. 

DConclude that the percentage of adult males who favor the death penalty is no more than 
the percentage of adult females who favor the death penalty when, in fact, the percentage of 
males is actually no more than the percentage of females. 


Solution: 


A 


Elementary Statistics: Exam 5: Lesson 11 
Exercise: 


Problem: 


Suppose the random variable X follows a chi-square distribution with 
degrees of freedom equal to 35. Fill in the blanks. 


Am 
Bs 


Solution: 


Az OD 


B. 8.3666 


Exercise: 


Problem: Check all that apply. 


AThe chi-square graph always has the same shape. 

BIf X follows a chi-square distribution with df = 200, then X 
approximately follows a normal distribution. 

CThe chi-square distribution is skewed to the right if the degrees 
of freedom are less than 90. 

DThe test statistic for the chi-square distribution may be zero. 
EA goodness-of-fit hypothesis test is always right-tailed. 

FA test of independence tests whether two factors are 
independent or not. 


Solution: 


B, C, D, and F 


Exercise: 


Problem: Write the null and alternate hypotheses for the following: 


It is believed that public high school students attend school in equal 
numbers for each day of the school week. Suppose a sample of the 
days students were present at school was taken for one particular high 
school: 


e 1750 students were present on Monday 

e 1800 students were present on Tuesday 

e 1840 students were present on Wednesday 
e 1810 students were present on Thursday 

e 1800 students were present on Friday 


Ho: 
Ha: 
Solution: 


Ho: Public high school students attend school in equal numbers for 
each day of the school week 


Ha: Public high school students DO NOT attend school in equal 
numbers for each day of the school week 


Exam 6: Chapter 12 
Practice final exam for use with Collaborative Statistics (col10522) by 
Barbara Illowsky and Susan Dean. 


Questions 1 — 3 refer to the following: 

New snowboarders (those who have snowboarded a year or less) often 
suffer from minor injuries. A random sample of seven new snowboarders 
produced the data on number of months snowboarding and number of 
minor injuries in the last month that they snowboarded. 


Months Snowboarding # of injuries in the last month 
2 9 

4 6 

8 3 

12 2 

1 9 

5 6 

9 2 

Exercise: 
Problem: 


Is the correlation between the number of months snowboarding and the 
number of injuries in the last month snowboarding significant? 


e AYes 
¢ BNo 
e CNot enough information to answer question 


Solution: 


A 


Exercise: 


Problem: The linear regression equation is: 


e Ayhat = 9.5904 — 0.9614 x 
e Byhat = -0.7349 + 9.5904z 
¢ Cyhat = 9.5904 + 0.7349z 
¢ Dyhat = 9.5904 — 0.73497 


Solution: 


D 
Exercise: 


Problem: 


If a new snowboarder has snowboarded for five (5) months, how many 
injuries would s/he have in the last month snowboarding? 


e A4.8 
e B47.2 
e €5.9 
e D13.3 


Solution: 


@ 


Exercise: 
Problem: 
If you calculate the line of best fit and the independent variable and the 


dependent variable have negative correlation, then the line of best fit 
has slope zero (0). 


e ATrue 
e BFalse 
e CThere is not enough information 


Solution: 


B 


Questions 5 — 6 refer to the following: 
The cost of a leading gourmet ice cream in different sizes is given in the 
table. 


Size (ounces) Cost 
16 $4.29 
32 $7.36 
64 $12.80 
96 $17.28 


Exercise: 


Problem: Are there any outliers? 


e AYes 
¢ BNo 
e CNot enough information 


Solution: 


B 
Exercise: 


Problem: 


If your friend used the line of best fit to predict the cost for a 128- 
ounce size of gourmet ice cream, what would you tell him/her with 
what you have learned about linear regression? 


e AThat a line is not the best fit for the cost of the gourmet ice 
cream. 

e BThat the 128-ounce size of the gourmet ice cream is far too 
expensive. 

e CThat we should switch the independent and dependent variables. 

e DThat s/he should not use the line of best fit to make this 
prediction. 


Solution: 


D 


Elementary Statistics: Quiz 1: Sampling and Data 
Exercise: 


Problem: 
A study is done to determine the average amount of television watched 


per week by California children. What is the phrase “average amount 
of television watched per week by California children” describing? 


e Adata 

e Bstatistic 

e Cparameter 
e Dvariable 


Exercise: 
Problem: 


How should you classify data from the following: What are the names 
of colleges you have attended? 


e Aquantitative - continuous 
e Bquantitative - discrete 
e Cqualitative 


Exercise: 
Problem: 


How should you classify data from the following: What is the number 
of times during the summer that you go to the beach? 


e Aquantitative - continuous 


e Bquantitative - discrete 
e Cqualitative 


Exercise: 


Problem: 


How should you classify data from the following: What is your 
weight? 


e Aquantitative - continuous 
e Bquantitative - discrete 
e Cqualitative 


Exercise: 


Problem: 


A study is done to determine the average salary for all Santa Clara 
County nurses. If we were to pick the Valley Medical Center, Good 
Samaritan Hospital, and Saint Louise Hospital in Santa Clara County 
and average together the salaries of all nurses, would this be a good 
sampling technique or a bad sampling technique? 


Exercise: 


Problem: 


A study is done to determine the average amount of television watched 
per week by California children. What is the phrase "the average 
amount of television watched per week by California children in a 
random survey" describing? 


e Asample 
e Bvariable 
e Cstatistic 
e Ddata 


Exercise: 


Problem: 


A study is done to determine the average amount of television watched 
per week by California children. Three of the California children 
watch two hours, 1 hour, and one-half hour, respectively. "Two hours, 
1 hour, and one-half hour" are the: 


e Adata 

e Bstatistic 

e Cparameter 
¢ Dsample 


Exercise: 
Problem: 


Determine the sampling technique: A medical researcher does a 
random survey of 100 female doctors and 100 male doctors. 


e Astratified 

e Bsystematic 

e Csimple random 
e Dcluster 


Exercise: 
Problem: 
Sixty-three people were asked the number of movies they saw in the 


movie theaters over their last vacation. Which of the following 
columns do we NOT add up? 


e Afrequency 


e Brelative frequency 
¢ Ccumulative relative frequency 


Exercise: 


Problem: 


Sixty-three people were asked the number of movies they saw in the 
movie theaters over their last vacation. Twenty-two people saw no 
movies, fourteen people saw one movie, ten people saw two movies, 
seven people saw three movies, six people saw four movies, and four 
people saw 5 or more movies. What percent of people saw 1 or 2 
movies? 


e Aabout 19% 
e Babout 38% 
e Cabout 60% 
e¢ Dabout 89% 


Elementary Statistics: Quiz 2: Descriptive Statistics 
Exercise: 


Problem: 


The following table shows the lengths of 64 international phone calls 
using a $5 prepaid calling card. 


Cumulative 
Data Relative Relative 
(minutes) Frequency Frequency Frequency 
4 25 0.3906 
14 15 
24 10 0.1563 
34 9 0.1406 
44 4 0.0625 
54 di 0.0156 1.0000 


Frequency of Phone Call Lengths 
Using the data, determine which ONE of the answers is correct: 


e AThe mean and the median are equal. 
e BThe mean is smaller than the median. 
e CThe mean is larger than the median. 


Exercise: 


Problem: Interpret the following box plot: 


Which of the following is correct? 


e A75% of the data are at most 5. 

e BThere is about the same amount of data from 2-5 as there is 
from 5-7. 

e CThere are no data values of 3. 

e D50% of the data are 4. 


Exercise: 


Problem: 
In a set of data, if all of the data appear with the same frequency, then: 


e AThe standard deviation is always 0. 

e BThe mean is always larger than the standard deviation. 
e CAII of the data have the same value. 

e DThe boxplot does not always look symmetrical. 


Exercise: 
Problem: 


The following table shows the lengths of 64 international phone calls 
using a $5 prepaid calling card. 


Data 
(minutes) 


4 
14 
24 
34 
44 


04 


Frequency of Phone Call Lengths 


Frequency 
25 
15 


10 


1 


Find the 60th percentile. 


e Al4 
e B60 
e C15 


e D0.2344 


Exercise: 


Relative 
Frequency 


0.3906 


0.1563 
0.1406 
0.0625 


0.0156 


Problem: Consider the following data set: 


46612 18 18 18 200 


Cumulative 
Relative 
Frequency 


1.0000 


What value is 2 standard deviations above the mean? 


e AThere is not enough information 
e BApproximately -98 
e CApproximately 102 
¢ DApproximately 169 


Exercise: 


Problem: Consider the following data: 
14 16 16 22 25 38 38 38 38 2000 
Which of the measures of central tendency would be the least useful? 


e Amean 
e Bmode 
e Cmedian 


Exercise: 
Problem: 


The following table shows the lengths of 64 international phone calls 
using a $5 prepaid calling card. 


Cumulative 
Data Relative Relative 
(minutes) Frequency Frequency Frequency 
4 25 0.3906 
14 15 
24 10 0.1563 


34 9 0.1406 


Cumulative 


Data Relative Relative 
(minutes) Frequency Frequency Frequency 
44 4 0.0625 

54 1 0.0156 1.0000 


Frequency of Phone Call Lengths 
What percent of the data is either 34 or 44 minutes? 


e AApproximately 78% 
e BApproximately 13% 
e CApproximately 98% 
¢ DApproximately 20% 


Questions 8 — 10 refer to the following: 
Sixty-four faculty members were asked the number of cars they owned 
(including spouse's and children's cars). The results are given in the 


histogram below: 
frequency 


0.45 


0 1 2 3 4 5 6 7 humber of cars 
Exercise: 


Problem: 
The number of responses that were either "1" or "3" is approximately: 


e A0.4 
¢ B27 


e C40 
e D2 


Exercise: 


Problem: Which of the following DOES NOT describe the data? 


e Athere are approximately 10 faculy members that own 7 cars 
e Bskewed left 

e CThere are no values of 5 

e Dskewed right 


Exercise: 


Problem: The third quartile is: 


e Al 
e B2 
e €3 
e DO.75 


Elementary Statistics: Quiz 3: Probability Topics 
Exercise: 


Problem: Given: 


¢ P(A) =0.4 
¢ P(B) = 0.6 
¢ P(A ORB) =0.91 


Find P(A[B). 


e A0.24 

e BO.4 

e €0.15 

e DCannot be determined. 


Exercise: 
Problem: 


If J and K are INDEPENDENT events, then which of the following is 
TRUE? 


e AP(J OR K) = P(J) + P(K) - P(J)*P(K) 
e BP(J AND K) =0 

e CJ and K are mutually exclusive events. 
¢ DP(J OR K)=0 


Exercise: 


Problem: 


A poll concerning credit cards found that 35 percent of respondents 
use a credit card that gives them one mile of air travel for every dollar 
they charge. Thirty percent of the respondents charge more than $2000 
per month. Of those respondents who charge more than $2000, 80 
percent use a credit card that gives them one mile of air travel for 
every dollar they charge. What is the probability that a randomly 
selected respondent expected to spend more than $2000 AND use a 
credit card that gives them a mile of air travel for every dollar they 
charge? 


¢ A(0.80)(0.35) 
¢ B(0.80)(0.30) 
* €(0.30)(0.35) 
* DO0.80 


Exercise: 


Problem: 
If P(J) = 0.3, P(K) = 0.6, and J and K are independent events, then: 


e AP(J and K) = 0 

e BP(J OR K) =0 

e¢ CP(J) does not equal P(J|K). 
e DP(J OR K) = 0.72 


Exercise: 


Problem: When sampling WITHOUT replacement, the second pick is 


e Aindependent from the first pick. 

¢ Ba complement of the first pick. 

e¢ Cmutually exclusive from the first pick. 
¢ Ddependent on the first pick. 


Exercise: 


Problem: Given: 


¢ P(A) =0.4 
¢ P(B) = 0.6 
¢ P(A ORB) = 0.65 


Find P(A AND B) 


e A0.35 
e Bl 

e €0.05 
e DO0.24 


Exercise: 


Problem: 


In a survey at a California ski resort, 100 skiers and snowboarders of 
different age groups were surveyed. The information is shown in the 
table. Suppose one person is randomly selected. Find the probability 
that the person was a skier OR was age 11 - 20. 


0-10 11-20 21-40 40+ 
Ski 10 12 30 8 
Snowboard 6 17 12 5 


Age of Participants by Sport 


e A77/100 


e B60/100 
e €12/100 
e D89/100 


Exercise: 


Problem: 


A poll concerning credit cards found that 35 percent of respondents 
use a credit card that gives them one mile of air travel for every dollar 
they charge. Thirty percent of the respondents charge more than $2000 
per month. Of those respondents who charge more than $2000, 80 
percent use a credit card that gives them one mile of air travel for 
every dollar they charge. What does 0.80 represent? 


e AP(air miles AND charge over $2000 per month) 
e BP(charge over $2000 per month | air miles) 

e CP(air miles OR charge over $2000 per month) 

e DP(air miles | charge over $2000 per month) 


Exercise: 
Problem: 


In a survey at a California ski resort, 100 skiers and snowboarders of 
different age groups were surveyed. The information is shown in the 
table. Suppose one person is randomly selected. Find the probability 
that the person was a snowboarder GIVEN he/she was age 21 - 40. 


0-10 11-20 21-40 40+ 


Ski 10 12 30 8 


0-10 11-20 21-40 
Snowboard 6 17 12 


Age of Participants by Sport 


e A40/100 
e B12/100 
e €12/40 
e D89/42 


Exercise: 


Problem: Given: 


¢ P(A) =0.4 
¢ P(B) = 0.6 
¢ P(A ORB) = 0.65 


Which of the following is FALSE? 


e AA and B are mutually exclusive. 

e BP(A OR B) = P(A) + P(B) - P(A AND B) 
e CP(A) = P(B’') 

e DA and B are dependent events. 


40+ 


Elementary Statistics: Quiz 4: Discrete Distributions 


Exercise: 
Problem: 

Xx P(X=x) 
0 1 

1 2 

y) ? 

4 3 

6 3 


Using the table, find the expected value. 


A3.3 
B3.5 
C3.6 
D3.4 


Exercise: 
Problem: 
The following table shows the probability distribution function for the 


number of high school years that students at a local high school study a 
single foreign language. 


x P(X=x) 


0 0.12 
1 0.08 
2 

3 0.34 
4 0.14 


Over the long run, the average number of years that we would expect 
students at this high school to study a single foreign language is: 


e A2.5 
e B3 
e C2 
e D2.3 


Exercise: 


Problem: 


x P(X=x) 


x P(X=x) 


Using the table, find the probability that X is greater than 1. 


e AO.1 
¢ BO.7 
e C0.3 
e D0.6 


Exercise: 


Problem: 


Approximately 70% of U.S. adults had at least one pet as a child. We 
randomly survey 60 U. S. adults. We are interested in the number that 
had at least one pet as a child. Does X follow a Binomial Distribution? 


e ANo, because not all children can have pets. 
e BNo, because 30% did not have pets. 

e CYes, because each adult has a 70% chance. 
e DYes, because most children like pets. 


Exercise: 
Problem: 
The following table shows the probability distribution function for the 


number of high school years that students at a local high school study a 
single foreign language. 


x P(X=x) 


0 12 
1 08 
2 

3 34 
A 14 


If 10 graduating students from this high school were randomly 
selected, would it be unusual for all 10 of them to have studied exactly 
3 years of a foreign language? 


e ANo, because each person is independent of the other. 

e BYes, because the probability for a student is 0.34, which is less 
than one-half. 

e CNo, because 3 is the mode. 

e DYes, because the probability is close to zero. 


Exercise: 
Problem: 


If sampling without replacement occurs, do the picks follow the 
Binomial Distribution? 


e AYes, if we are counting the number of successes. 

e BNo, because the probability of success on each pick changes. 
e CyYes, because each pick is independent from the others. 

e DNo, because we may not have any successes. 


Exercise: 


Problem: 


The following table shows the probability distribution function for the 
number of high school years that students at a local high school study a 


single foreign language. 


x P(X=x) 
0 a2 

1 08 

2 

3 34 

4 14 


What is the probability that X=2? 


e A0.42 
e BO.32 
e Cunable to determine 
e DO.20 


Exercise: 


Problem: 


Approximately 70% of U.S. adults had at least one pet as a child. We 
randomly survey 60 U. S. adults. We are interested in the number that 
had at least one pet as a child. The probability that at least 3 adults had 
at least one pet as a child means: 


¢ AP(X=0)+P(X=1)+P(X=2)+P(X=3) 
¢ BP(X=0)+P(X=1)+P(X=2) 

© CP(X=4)+P(X=5)+P(X=6)+ ... 

¢ DP(X=3)+P(X=4)+P(X=5)+ ... 


Exercise: 


Problem: 


Ninety-four percent of California community college transfers feel that 
their community college adequately prepared them to handle upper- 
division coursework at their transfer university. We randomly survey 
14 California community college transfers. We are interested in the 
number that feel that their community college adequately prepared 
them to handle upper division coursework at their transfer university. 
List the values that X, the Random Variable, may take on. 


0, 1, 2, ..., 94 
ab, 2 dy snp de 
0, 1, 2, ..., 14 
D2 Dy. ny 4 


Exercise: 


Problem: 


Ninety-four percent of California community college transfers feel that 
their community college adequately prepared them to handle upper- 
division coursework at their transfer university. We randomly survey 
14 California community college transfers. We are interested in the 
number that feel that their community college adequately prepared 
them to handle upper division coursework at their transfer university. 
Find the probability that more than half felt that they were prepared. 


e A0.00 
e B1.00 
e €0.53 
e D6.59 


Elementary Statistics: Quiz 5: Continuous Random Variables 
Exercise: 


Problem: 


Given that X ~ U(3, 15), find the probability that X is between 12 and 
14. 


e A2/12 
e B12/14 
e C2/15 
e D2 


Exercise: 


Problem: 


Given that X ~ U(3, 15), find the probability that X is greater than 5 
given that X is less than 10. 


e A5/12 
e B5/10 
e C5/7 

e D10/7 


Exercise: 


Problem: Given that X ~ U(3, 15), find the 70th percentile. 
e AO.70 
e BS.4 


e C11.4 
e D10.5 


Exercise: 


Problem: Given that X ~ U(3, 15), f(x) = 


e Al/15 
e B1/18 
e C1/12 
e Dil 


Exercise: 


Problem: 
Given that X ~ U(3, 15), what percent of the area is between 1 and 4? 


e A8 1/3 

e B25 

« C20 

e Dunable to determine 


Exercise: 


Problem: The probability that "X is at most 3" is interpreted as: 


¢ AP(X < 3) + P(X=3) 
¢ BP(X < 3) 

¢ CP(X > 3) 

¢ DP(X > 3) + P(X =3) 


Exercise: 


Problem: Given that X ~ Exp(0.4), find the decay rate. 


e A0.4 
e B1/0.4 
e C4 

e D2.5 


Exercise: 


Problem: Given that X ~ Exp(0.4), find the median. 


e A2.5 
e B1.25 
e C1.73 
e D1/4 


Exercise: 


Problem: 


X follows an Exponential Distribution with an average of 12. Which of 
the following is true? 


e AThe median is 12. 

e BThe decay rate is 12. 

e CThe mode is 12. 

e DThe standard deviation is 12. 


Exercise: 


Problem: 


X follows a Uniform Distribution from 2 to 5. What can be said of the 
median? 


e Alt is the same as the standard deviation. 
e BBoth (1) and (2) are correct. 

e Cit is the same as the average. 

e Dit is less than the average. 


Elementary Statistics: Quiz 6: The Normal Distribution 
Exercise: 


Problem: Given X ~ N (300, 15), find P(280 < X < 340). 


e A0.2000 
BO.7460 
C0.9050 
e D0.9999 


Exercise: 


Problem: Given X ~ N(300, 15), find the 70th percentile. 


e A0.5244 
e B210.00 
e €307.87 
¢ D324.96 


Exercise: 


Problem: Given X ~ N(300, 15), what can be said of the median? 


e AThe mode = median and the mode = the mean. 
e Blt is the same as the mode. 

e Cit is less than the average. 

e DIt is the same as the average. 


Exercise: 


Problem: Given X ~ N(300, 15), the area to the right of x = 330... 


e Ais the same as the area to the right of z = 2. 
e¢ BP(X > 330) is the same as the area to the right of z = 2 AND to 
the left of X = 270. 


e Cis the same as the area to the left of x = 270. 
e Dis the same as the area to the left of x = 330. 


Exercise: 


Problem: Given X ~ N(300, 15), find the Interquartile Range (IQR). 


e A0.50 

e B310.12 
¢ €C20.24 
e D289.99 


Exercise: 


Problem: 
Given X ~ N(300, 15), find the z-score associated with x = 290. 


e A15/10 
e B-10 

e €C10/15 
e D-10/15 


Exercise: 


Problem: For what do we use z-scores? 


e ABecause of the use of technology, there is no longer any use for 
Z-SCOres. 

e Bto standardize scores from two or more different normal 
distributions so that we may compare the scores. 

e Cto help us calculate uniform and exponential probabilities. 

e Dto make our calculations easier because the mean = 0. 


Exercise: 


Problem: Given that X ~ N(10, 2), X > 20... 


e Acannot happen. 

¢ Bcan happen 1/5 of the time. 

e Cis very unlikely to occur. 

e Dwe cannot determine its probability. 


Exercise: 
Problem: 


Given that X ~ N(10, 2) and Y follows the Exponential Distribution 
with a mean of 10, which of the following are correct? 


e AThe median for Y is greater than the median for X. 
¢ BThe median for Y is less than the median for X. 

e CThe percentiles for X and Y are also equal. 

e DThe median for X and Y are also equal. 


Exercise: 
Problem: 


For data that is normally distributed, is it possible for the standard 
deviation to be larger than the mean? 


e ANo. 
e BYes. 
e CThere is not enough information to determine. 


Elementary Statistics: Quiz 7: The Central Limit Theorem 
Exercise: 


Problem: 


Given that X ~ N(300, 15), we survey 20 at a time and are interested in 
the distribution of X-bar. As the number of data values averaged 
together gets larger, what happens to the standard error of the mean 
(the standard deviation of X-bar)? 


e Alt becomes larger. 

e Blt becomes smaller. 

e Clt approaches the average of X-bar. 

e Dit approaches the standard deviation of X. 


Exercise: 


Problem: 


Given X ~ Exp(1/8), we survey 100 at a time and are interested in the 
distribution of X-bar. Find the standard deviation of X-bar. 


e A0.8 

e BO.0125 
e €C0.0800 
e DO.125 


Exercise: 


Problem: 


Given X ~ Exp(1/8), we survey 100 at a time and are interested in the 
distribution of X-bar. Find the third quartile of X-bar. 


e AG 

e B0.6321 
e €8.54 
e D2.30 


Exercise: 


Problem: 


Given that X ~ N(300, 15), we survey 20 at a time and are interested in 
the distribution of X-bar. Find the 33rd percentile of the distribution 
for X-bar. 


e A298.6 
e B293.5 
e C301.4 
e D100.0 


Exercise: 
Problem: 


Given that X ~ N(300, 15), we survey 20 at a time and are interested in 
the distribution of X-bar. Find the z-score associated with x-bar = 290. 


e A-10/15 

e B-10 

e C-2.9814 
e D-13.3333 


Exercise: 
Problem: 


Given X ~ Exp(0.125), we survey 100 at a time and are interested in 
the distribution of X-bar. P(X-bar < 7)= 


e A0Q.1056 
e B0O.5831 
e €0.9987 
e DO.0001 


Exercise: 


Problem: 


Given X ~ U(2,18), we survey forty at a time and are interested in the 
distribution of X-bar. The standard deviation of X-bar is: 


e Al.5811 
e BO.1155 
e €0.7303 
e D4.62 


Exercise: 
Problem: 


Given X ~ U(2,18), we survey forty at a time and are interested in the 
distribution of X-bar. P(X < 4) is P(X-bar < 4). 


e Aless than 

¢ Bequal to 

e Cgreater than 
e Dat most 


Exercise: 
Problem: 


Given X ~ U(2,18), we survey forty at a time and are interested in the 
distribution of X-bar. P(2 < X < 18) is P(2 < X-bar < 18) 


e Aless than 

¢ Bequal to 

e Cgreater than 
e Dat most 


Exercise: 


Problem: 


Given that X ~ N(300, 15), we survey 20 at a time and are interested in 
the distribution of X-bar. What can be said about the median of the 
random variable X-bar? 


e Alt is the same as the median of X. 

e Blt is the same as the average of X-bar. 
e Cit is the same as the average of X. 

e DA, B, C are all correct. 


Elementary Statistics: Quiz 8: Confidence Intervals 
Exercise: 


Problem: 


When constructing a confidence interval for the population mean, the 
case where the population standard deviation is known is 


e Apossible, and the most common 
e Bpossible, but not common 

e Cpossible, and quite common 

¢ Dimpossible 


Exercise: 
Problem: 
When constructing a Confidence Interval, which case is preferable? 


e Alarge confidence level, small confidence interval 
e Bsmall confidence level, small confidence interval 
e Csmall confidence level, large confidence interval 
e Dlarge confidence level, large confidence interval 


Exercise: 


Problem: Which of the following is NOT a point estimate? 


e Ax-bar 

e Bp-hat or p-prime 
e Cs 

e Dsigma 


Exercise: 


Problem: 


When should the Student-t distribution be used in constructing a 
confidence interval for the population mean? 


e AWhen the population proportion is known and the underlying 
population is normally distributed. 

e BWhen the population standard deviation is unknown and the 
underlying population is normally distributed. 

e CWhen the population proportion is unknown and the underlying 
population is normally distributed. 

e¢ DWhen the population standard deviation is known and the 
underlying population is normally distributed. 


Exercise: 


Problem: 


When constructing a Confidence Interval for the population mean and 
both the sample standard deviation and the population standard 
deviation are known, which of the following should be used? 


e Anormal distribution using the sample standard deviation 

e¢ Bnormal distribution using the population standard deviation 

e Cstudent-t distribution using the population standard deviation 
e Dstudent-t distributuion using the sample standard deviation 


Exercise: 


Problem: 


A study was done to determine the proportion of voters that feel that 
their local government is doing an adequate job. Of the 160 voters 
surveyed, 144 feel that their local government is doing an adequate 
job. Calculate the 95% confidence interval for the true proportion of 
voters that feel that their government is doing an adequate job. 


¢ A(0.85, 0.95) 


¢ B0.90 
¢ C(0.80, 1.00) 
¢ D(144, 160) 


Exercise: 


Problem: 


To reduce the error bound in a future study, which of the following 
should be done? 


e Aincrease the sample size n 

¢ Bdecrease the sample size n 

e Cconduct a biased survey 

e Dincrease the confidence level 


Exercise: 


Problem: 


A study was done to determine the average number of homes that a 
homeowner owns in his or her lifetime. For the 60 homeowners 
surveyed, the sample average was 4.2 and the sample standard 
deviation was 2.1. Calculate the 95% confidence interval for the true 
average number of homes that a person owns in his or her lifetime. 


¢ A(3.90, 4.50) 
° B(3.66, 4.74) 
¢ C(4.01, 4.39) 
¢ D(3.67, 4.73) 


Exercise: 


Problem: 


A study was done to determine the average number of homes that a 
homeowner owns in his or her lifetime. For the 60 homeowners 
surveyed, the sample average was 4.2 and the sample standard 
deviation was 2.1. The distribution to use to calculate the 95% 
confidence interval is 


e AExponential 

e BStudent-t with df = 59 
e CStudent-t with df = 60 
e DBinomial 


Exercise: 


Problem: 


Calculate the error bound for a survey in which the sample mean is 5, 
the population standard deviation is 3, the confidence level is 0.90, and 
the number surveyed is 12. 


© A1.56 
¢ B1.42 
© C(3.58, 6.42) 
¢ D(3.44, 6.56) 


Elementary Statistics: Quiz 9: Hypothesis Testing, Single Mean and Single 
Proportion 
Exercise: 


Problem: 


A study is done to see if the average age a "child" moves permanently 
out of his parents' home in the United States is at most 23. 43 U.S. 
Adults were surveyed. The sample average age was 24.2 with a 
standard deviation of 3.7. The p-value is 


e A0.0334 
e B2.13 

e €0.0167 
e DO0.0197 


Exercise: 


Problem: 


A study is done to see if the average age a "child" moves permanently 
out of his parents' home in the United States is at most 23. 43 U.S. 
Adults, all age 40, were surveyed. The sample average age was 24.2 
with a standard deviation of 3.7. The alternate hypothesis is 


e Apopulation mean >= 24.2 
e¢ Bpopulation mean <=23 

e Cpopulation mean > 24.2 
e Dpopulation mean > 23 


Exercise: 


Problem: 


A study is done to see if the average age a "child" moves permanently 
out of his parents' home in the United States is at most 23. 43 U.S. 
Adults, all age 40, were surveyed. The sample average age was 24.2 
with a standard deviation of 3.7. Which is the Type I error? 


e AConclude that the average age is greater than 23, when it is 
24.2. 

e BConclude that the average age is at most 24.2, when it is at most 
24.2. 

e CConclude that the average age is at most 23, when it is greater 
than 23. 

e DConclude that the average age is greater than 23, when it is at 
most 23. 


Exercise: 


Problem: 


A study is done to see if the average age a "child" moves permanently 
out of his parents' home in the United States is at most 23. 43 U.S. 
Adults, all age 40, were surveyed. The sample average age was 24.2 
with a standard deviation of 3.7. Which is the Type II error? 


e AConclude that the average age is greater than 23, when it is at 
most 23. 

¢ B Conclude that the average age is greater than 23, when it is 
24.2. 

e CConclude that the average age is at most 24.2, when it is at most 
24.2. 

e DConclude that the average age is at most 23, when it is greater 
than 23. 


Exercise: 


Problem: 


Consider the statement, "New cars are expected to last an average of 
three years before needing major service done to them." With a p-value 
of 0.0079, we conclude that: 


e ANo conclusion can be made. 
e Bnew cars last an average of less than three years before needing 
major service done to them. 


e Cnew cars last an average of at least three years before needing 
major service done to them. 


Exercise: 


Problem: 


Given the set of hypotheses: Ho: p = 0.4 Ha: p < 0.4. This test is 


e Atwo-tailed 
¢ Bright-tailed 
¢ Cleft-tailed 
e Dno-tailed 


Exercise: 


Problem: 


Given the set of hypotheses: Ho: p = 0.4 Ha: p < 0.4. The probability 
distribution to use for the hypothesis test is the 


e Abinomial 

e Bnormal 

e Cstudent-t 

e Dexponential 


Exercise: 


Problem: 


Given the set of hypotheses: Ho: p = 0.4 Ha: p < 0.4. If the estimated 
proportion is 0.35, then the p-value can be interpreted as 


e Athe probability that the estimated proportion is the same as the 
p-value. 

¢ Bthe probability that the estimated proportion is at least 0.35, 
when the population proportion is, in fact, equal to 0.4. 


e Cthe probability that the population proportion is at most 0.4, 
when the estimated proportion is equal to 0.35. 

¢ Dthe probability that the estimated proportion is at most 0.35 
when the population proportion is, in fact, equal to 0.4. 


Exercise: 


Problem: 


Consider the statement, "New cars are expected to last an average of at 
least three years before needing major service done to them." Which of 
the following is the null hypothesis? 


e AThe population mean <=3. 

e BThe population mean < 3. 

e CThe population mean > 3. 

e DThe population mean is >= 3. 


Exercise: 


Problem: 


Consider the statement, "New cars are expected to last an average of 
three years before needing major service done to them." With a p-value 
of 0.2456, which is the correct decision? 


e ANo decision can be made. 
e BReject the null hypothesis. 
e CDo not reject the null hypothesis. 


Elementary Statistics: Quiz 10: Hypothesis Testing: Two Means, Two 
Proportions, Paired Data 
Exercise: 


Problem: 


Of interest is whether the percent of full-time international college 
students that take over 16 units per term is the same as the percent of 
full-time local college students. Sixty full-time international students 
are surveyed. Forty-six of them take over 16 units per term. One 
hundred forty-nine full-time local students are surveyed. Ninety-eight 
of them take over 16 units per term. The hypothesis test to use is 


e Amatched or paired samples 

e Btwo proportions 

e Cindependent group means, population standard deviations 
known 

e Dindependent group means, population standard deviations 
unknown 


Exercise: 


Problem: 


A study is done to determine if girls lose their first baby tooth earlier, 
on average, than boys. Twenty-five girls and twenty-five boys are 
surveyed. The average ages were 5.8 years for girls with a standard 
deviation of 0.6 years, and 5.9 years for boys with a standard deviation 
of 0.7 years. The hypothesis test to use is 


e Atwo proportions 

e¢ Bmatched or paired samples 

e Cindependent group means, population standard deviations 
known 

e Dindependent group means, population standard deviations 
unknown 


Exercise: 


Problem: 


Fourteen cars from two different models of a car manufacturer are 
compared to see if the smaller model has a higher average MPG than 
the larger model. From years of testing these models, it is known that 
the standard deviation of MPG is 3.4 MPG for the smaller model and 
4.0 MPG for the larger one. From this survey, the sample means and 
standard deviations are 28.2 and 3.0, and 22.6 and 5.1, respectively. If 
we conduct a hypothesis test, the decision and conclusion are: 


e AReject the null hypothesis and conclude that the average MPG 
for the smaller car is higher than for the larger model. 

¢ BDo not reject the null hypothesis and conclude that the average 
MPGs are the same. 

e CReject the null hypothesis and conclude that the average MPGs 
are different. 

e DDo not reject the null hypothesis and conclude that the average 
MPG for the smaller car is higher than for the larger model. 


Exercise: 


Problem: 


Eighty-four people participate in a coordination study. They are each 
timed for how long they can jump rope without a mistake with their 
eyes open. Then, they each repeat the task with their eyes closed. For 
each individual, the difference in times is calculated. The purpose of 
the study is to determine if the lack of vision in normally seeing 
individuals affects coordination. The hypothesis test to use is 


e Atwo proportions 

e Bmatched or paired samples 

e Cindependent group means, population standard deviations 
unknown 


¢ Dindependent group means, population standard deviations 
known 


Exercise: 


Problem: 


Twelve pairs of identical twins that were separated at birth are reunited 
at age 21. Each twin is weighed. Of interest is if identical twins raised 
separately will weigh within 10 pounds of each other as young adults. 
The sample average difference was 12.1 pounds with a standard 
deviation of 3.4 pounds. If we conduct a hypothesis test, the 
distribution to use is 


e ANormal 

e BBinomial 

e CStudent-t with df = 24 
e DStudent-t with df = 11 


Exercise: 


Problem: 


Forty cars from two different models of a car manufacturer are 
compared to see if the smaller model has a higher average MPG than 
the larger model. From years of testing these models, it is known that 
the standard deviation of MPG is 3.4 MPG for the smaller model and 
3.6 MPG for the larger one. From this survey, the sample means and 
standard deviations are 28.2 and 3.5, and 22.6 and 3.7, respectively. 
The hypothesis test to use is 


e Atwo proportions 

¢ Bindependent group means, population standard deviations 
known 

e Cmatched or paired samples 

e Dindependent group means, population standard deviations 
unknown 


Exercise: 


Problem: 


Thirty children and forty-five adults were surveyed to determine if 
children watch, on average, the same amount of television as adults per 
day. The averages from the study were 3.4 and 2.7 hours per day, 
respectively. The standard deviations from the study were 2.3 and 1.4 
hours per day, respectively. The hypothesis test to use is 


Amatched or paired samples 

Btwo proportions 

Cindependent group means, population standard deviations 
unknown 

Dindependent group means, population standard deviations 
known 


Exercise: 


Problem: 


Twelve pairs of identical twins that were separated at birth are reunited 
at age 21. Each twin is weighed. Of interest is if identical twins raised 
separately will weigh within 10 pounds of each other as young adults. 
The hypothesis test to use is 


Aindependent group means, population standard deviations 
unknown 

Bindependent group means, population standard deviations 
known 

Cmatched or paired samples 

Dtwo proportions 


Exercise: 


Problem: 


One hundred men and one hundred women are surveyed to determine 
if the percent of women that lie about their weight is the same as the 
percent of men, or if the percent of women is higher than the percent 
of men. The hypothesis test to use is 


e Aindependent group means, population standard deviations 
known 

e Btwo proportions 

e Cindependent group means, population standard deviations 
unknown 

e Dmatched or paired samples 


Exercise: 


Problem: 


Fourteen cars from two different models of a car manufacturer are 
compared to see if the smaller model has a higher average MPG than 
the larger model. From years of testing these models, it is known that 
the standard deviation of MPG is 3.4 MPG for the smaller model and 
4.0 MPG for the larger one. From this survey, the sample means and 
standard deviations are 28.2 and 3.0, and 22.6 and 5.1, respectively. If 
we conduct a hypothesis test, the p-value is 


e A0.0004 
e BO.0002 
e €0.0000 
e DO0.0010 


Elementary Statistics: Quiz 11: The Chi-Square Distribution 
Exercise: 


Problem: 
As the degrees of freedom increase (and especially when the degrees 


of freedom are more than 90), the graph of the chi-square distribution 
looks more and more 


e Asymmetrical 
e¢ Bskewed right 
e¢ Cskewed left 
e¢ Dasymmetrical 


Exercise: 


Problem: 
The expected values are what we have from 


e Adata 

e Bsampling 

e Ctheory 

e Dexperiments 


Exercise: 
Problem: 


Which test is the correct one to use when determining if the gender of 
a person is independent of the college major of the person? 


e AGoodness-of-Fit 
e Best for Independence 


Exercise: 


Problem: 
The Goodness-of-Fit hypothesis test is typically a 


e Atwo-tailed test 

¢ Bwagging-tailed test 
e Cleft-tailed test 

e Dright-tailed test 


Exercise: 
Problem: 


What is the relationship between the mean and the standard deviation 
of the chi-square distribution? 


e AThey are the same. 

e BThey are inverses of each other. 

e CThe standard deviation is twice the mean. 

e DThe standard deviation is the square root of 2 times the mean. 


Exercise: 


Problem: 
The Chi-Square Test for Independence is typically a 


e Aleft-tailed test 
e Bright-tailed test 
e Ctwo-tailed 

e Dwagging-tailed 


Exercise: 
Problem: 


Which test is the correct one to use when determining if a class 
distribution of grades follows the normal distribution? 


e AGoodness-of-Fit 
e BTest for Independence 


Exercise: 
Problem: 


Which test is the correct one to use when determining if the numbers 
picked in the lottery are randomly selected? 


e AGoodness-of-Fit 
e BTest for Independence 


Exercise: 


Problem: 


Which test is the correct one to use when determining if the religion of 
a person is related to his/her political party affiliation? 


e AGoodness-of-Fit 
e BTest for Independence 


Exercise: 


Problem: 


An eight-sided die is rolled 160 times. The number of 1’s, 2’s, 3’s, 4’s, 
5’s, 6’s, 7’s & 8’s obtained are 24, 36, 15, 17, 9, 22, 18 & 19, 
respectively. We are testing to see if the die is fair. What is the test 
statistic for this problem? 


e Al2.7 
e B21.8 
e C9.4 

e D46.3 


Elementary Statistics: Quiz 12: Linear Regression and Correlation 
Exercise: 


Problem: 
If there is a linear relationship between the quantity of food a person 


consumes per day and the person’s weight, the quantity of food a 
person consumes per day is the 


e Aindependent variable 
e BUnable to determine 
e Cdependent variable 


Exercise: 


Problem: 


A correlation coefficient for 9 pairs of data is calculated to be -0.213. 
Is the relationship between the variables statistically significant? 


e AUnable to determine 
e BNo 
e CYes 


Exercise: 


Problem: 

Consider the following pairs of data (independent variable given first): 
(17, 56) (13, 48) (35, 65) (89, 146) (55, 121) (86, 130) 

Calculate the Least Squares Line. 


e Ay-hat = -20.93 + 0.73x 
e By-hat = 33.33 + 1.24x 
e Cy-hat = 1.24 + 33.3x 

e Dy-hat = 0.73 - 20.93x 


Exercise: 


Problem: 
A correlation coefficient for 6 pairs of data is calculated to be 0. Then, 


e Athere is positive correlation between the variables. 
e Bthe Line of Best Fit has a negative slope. 

e Cthere is negative correlation between the variables. 
e Dthe Line of Best Fit has a slope of zero. 


Exercise: 
Problem: 
A study is done to see if there is a linear relationship between the life 


expectancy of an individual and the year of birth. The year of birth is 
the 


e AUnable to determine 
e Bdependent variable 
e Cindependent variable 


Exercise: 


Problem: If you determine a point is an outlier, you would 


e Aignore it. 

¢ Balways delete it and then recalculate the line of best fit. 

e Cexamine it carefully and try to determine what is causing it to be 
an outlier. 

e Duse it in the calculation of the Sum of Squared Errors. 


Exercise: 


Problem: 


Consider the following pairs of data (independent variable given first): 


(17, 56) (13, 48) (35, 65) (89, 146) (55, 121) (86, 130) 
Find the estimated y for an x-value of 40. 


e A-836 
e B1334 
e C83 
e D8.3 


Exercise: 


Problem: 


A study is done to see if there is a linear relationship between the 
calendar year and the population of a particular country. The calendar 
year is the 


e Adependent variable 
e BUnable to determine 
e Cindependent variable 


Exercise: 
Problem: 
Consider the following pairs of data (independent variable given first): 
(16, 56) (10, 98) (35, 105) (4, 70) (12, 121) 
The relationship between the two variables is 
e Alinear 
e Bnot significant 


e Cparabolic 
e Dexponential 


Exercise: 


Problem: 
Consider the following pairs of data (independent variable given first): 
(16, 56) (10, 98) (35, 105) (4, 70) (12, 121) 
The regression line is 
e Asteeply sloped downward 


¢ Balmost horizontal 
e Csteeply sloped upward 


Elementary Statistics: Calculator Instructions and Resources 


Graphing Calculator Instructions 


e TI-83 video tutorials are available here: (Part 1, Part 2). 

e The Discrete Mean & Stdev link shows how to calculate a mean and 
stdev for a discrete random variable. 

e The Outliers link shows how to calculate outliers using the lists. 

e The Matrix: TI-89 link shows how to create a matrix using the TI-89 
calculator for use in a Test of Independence. 

e The Random Numbers link shows how to generate random numbers. 

e The Sorting link shows how to sort a list. 

¢ The TI-89 link gives some general instructions for the TI-89 plus 
linear regression instructions. 

e Link to Graphing Calculator Help for Several TI Calculators 


Calculating the Mean and Standard Deviation of a Discrete 
Probability Distribution on the TI-83 and TI-86 


1. Enter the possible values for your random variable X in list L1. 

2. Enter the probabilities for each value of X in list L2 in the position 
next to the value of X. 

3. Calculate “One Variable Statistics” for lists L1 and L2. 

4. (Do this the same way you calculate one variable statistics for data 
values and frequencies when given a set of data.) 


5. The calculator will give the mean as "x-bar" ¢ although we know that 
this mean is actually p because it is the mean of a probability 
distribution. 

6. The calculator will give the standard deviation as 0 which indicates the 
standard deviation of a probability distribution (as well as the standard 
deviation of a population). 

7. The calculator does not give you a value for s because the frequencies 
you gave it are not whole numbers. 


Outlier Instructions for the TI-83, 86, and 89 Calculators 


When you finish going over these instructions, do TEXT problem #5 
("stories" and "height of building") in Ch. 12. One of the points is an 
outlier. 


This explains how to find outliers on various calculators. Suppose the data 
is (3,5), (6,8), (9,7), (5,20). The xlist is 3, 6, 9,5 and the ylist is 5, 8, 
7, 20. Put the X1list into L1 and the ylist into L2. Do the linear 
regression. yhat = 11.2267 - .2133x 


Note: TI-89: Use Flashapps to enter your lists and then find your 
instructions below the instructions for the TI-83 & 86. 


For TI-83 & 86 Calculators 


1. Go back to where you entered the lists L1 and L2 and go to the list 
name L3. Enter 11.2267 - .2133L1 (You enter the equation at the 
bottom of the screen where it says L3. Press Enter. In L3 are the 
yhat values. 

2. Arrow to the list name L4. Enter L2-L3. Press Enter. (In L4 are the 
y - yhat values.) 

3. Arrow to the list name L5. Enter L442. Press Enter. (In L5 are the 
(y - yhat)2 values.) 


. Continue by finding the Instructions for the appropriate calculator 


below (83 or 86): 


For TI-83 


1. 


& 


Exit to the Home Screen. Clear it. Press 2nd LIST. Arrow to MATH. 
Press 5: Sum. (Press L5). Press Enter. You should see 137.1467 (to 
4 decimal places). This is the SSE. 


. Calculate s. Press the square root symbol and enter (137.1467/2). 


(You get the denominator by taking the number of data points and 
subtracting 2: 4 - 2 = 2.) Press Enter. You should see 8.2809 (to 4 
decimal places). 


. Multiply 8.2809 by 1.9. You should see 15.7337. 
. Press L4. Press Enter. (Use the arrow keys to scroll through the list.) 
. Compare 15.7337 to the absolute values of the numbers in L4. If any 


absolute value is greater than or equal to 15.7337, then the 
corresponding point is an outlier. 


. Absolute values of the numbers in L4 are 5.587, 1.947, 2.307, 9.8398. 


None of them are greater than 15.7337, so no point is an outlier. 


For TI-86 


Ih, 
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Exit to the Home Screen. Clear it. Press 2nd LIST. Press F5 
(OPS). Press MORE. Press F1 (Sum). Press 2nd F3 (NAMES). 
Find the F key with L5 and press it. Press Enter. You should see 
137.1467 (to 4 decimal places). This is the SSE. 


. Calculate s. Press the square root symbol and enter (137.1467/2). 


(You get the denominator by taking the number of data points and 
subtracting 2: 4 - 2 = 2.) Press Enter. You should see 8.2809 (to 4 
decimal places). 


. Multiply 8.2809 by 1.9. You should see 15.7337. 
. Press L4. Press Enter. (Use the arrow keys to scroll through L4.) 
. Compare 15.7337 to the absolute values of the numbers in L4. If any 


absolute value is greater than or equal to 15.7337, then the 


corresponding point is an outlier. 
6. Absolute values of the numbers in L4 are 5.587, 1.947, 2.307, 9.8398. 
None of them are greater than 15.7337 so no point is an outlier. 


For TI-89 


1. NOTE: When you do Linear Regression, a list called "reSid" is 
created automatically. This list has the y - yhat values in it. 

2. Go into FLlashapps and into your lists. Arrow to "resid" so you 
see it. Then arrow to the list name L3. Press F3. Press 1: names. 
Arrow down to STATVARS. If it is not checked, press F4. Then arrow 
down to resid. Press Enter. Press 42. L3 will have the (y - 
yhat ) 42 values. Press HOME and Press F1 8 to clear it. Press 2nd 
MATH. Press 3:List. Press 6: Sum. Press alpha L3. Depending on 
how you have MODE Display Digits set, you should see 
approximately 137.1467. This is the SSE. 

3. Calculate s. (You should be HOME.) Press clear. Press the square 
root symbol and enter 137 . 1467/2). (You get the denominator by 
taking the number of data points and subtracting 2: 4 - 2 = 2.) Press 
Enter. You should see 8.2809 (to 4 decimal places). 

4. Press the times key and enter 1.9. Press Enter. You should see 
15,7337 

5. Press Clear. Press 2nd VAR-LINK. Arrow down to resid (It is 
below STAT VARS). Press Enter. Arrow up to the list. Scroll 
through the list using the arrow keys. 

6. Compare 15.7337 to the absolute values of the numbers in the list. If 
any absolute value is greater than or equal to 15.7337, then the 
corresponding point is an outlier. 

7. Absolute values of the numbers in the list are approximately 5.59, 
1.95, 2.31, 9.84. None of them are greater than 15.7337 so no point is 
an outlier. 


Create a Matrix to use in a Test of Independence on the TI-89 


1. Press APPS 6:Data/Matrix Editor 


. Press 3: New 

. Arrow over and down to 2:Matrix. 

. Press Enter 

. Arrow down to Folder. Either use the one that is there or arrow over 
and down to another folder name (don't use Statvars) and press 

Enter. 

6. Arrow down to Variable and enter a name you will remember. 

7. Arrow down to ROw dimension and enter the number of rows you 
want. 

8. Arrow down to Column dimension and enter the number of 
columns you want. (NOTE: You can change these numbers if you 
want for a different problem.) 

9. Press Enter until you see your matrix with zeroes as the entries. Fill 
in your matrix with the data from the table. 

10. Press APPS 

11. Press 1: Flashapps 

12. Press Enter 

13. Press Bagot 

14. Press 8:Chi2 2-way 

15. Enter the name of your matrix atObserved Mat: 

16. Press Enter. 

17. You should see the screen with the test statistic and the p-value. 
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¢ To change the size of your matrix, press APPS 6:Matrix/Data 
Editor 2:Open. 

e Then fill in the OPEN screen with Matrix, the correct folder, and the 
correct name of your matrix. 

e Press Enter until you see your matrix. 

e To resize it, press F6 Util 6:Resize Matrix. Enter the row 
dimension and arrow down to column dimension and enter that 
number. Press Enter until you see your resized matrix. 


Generating Random Numbers with the TI Calculators 


The following functions will generate random numbers from the list of 
numbers 1 - 60. 


TI-83 


1. Press MATH. Arrow over to PRB. 

2. Press 5: randIint. Enter (1,60). 

3. Press Enter and you will see the first random number. 

4. Keep pressing Enter to get random numbers between 1 and 60, 
inclusive. 


TI-86 


1. Press 2nd MATH. 

2. Press PROB (in F2). 

3. Press randint (in F5). Enter (1, 60). 

4. Press Enter and you will see the first random number. 
5. Press Enter and you will see the first random number. 


TI-89 


e Press 2nd MATH. 

e Press 7:Probability. 

e Press 4: rand. Enter (1, 60)and press Enter. 

e You will see the first random number. 

e Keep pressing Enter to get random numbers between 1 and 60, 
inclusive. 


How to Sort Data Using the TI Calculators 


Only sort data where the frequencies are all 1. 


TI-83 


1. Enter the data into a list L1, L2, L3, L4, L5, or L6. 


2. Press 2nd LIST (above the STAT key). 

3. Arrow over (use the right arrow) to OPS. 

4. Press 1 (for 1: SORTA). 

5. Enter the name of your list and a right parenthesis. Press Enter. 
6. Press STAT 1 and look at your list. It should be in sorted order. 


TI-86 


. Enter the data into a list. 

. Press a. 

. Press 2nd LIST (above — key). 

. Press OPS (in F5). Press SortA (in F2). 

. Enter the list name by pressing 2nd NAMES (in M3). 

. Arrow to your list and press the appropriate F key. 

. Press the STO-> key. 

. Enter the same list name. Press Enter. 

. Check that your list is sorted by going back into 2nd STAT EDIT. 
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TI-89 


Note:For the TI-89, get Flashapps from the TI Web site or have your 
instructor put it into your calculator. 


1. Press APPS 1 (for1:Flashapps). Press Enter. 

2. Press Enter again. Enter your data into a list (Enter data into one of 
the L lists, L1, L2, L3, L4). 

3. Press F3 (for List). Press 2 (for 2: Ops). 

4. Press 1 (for1:Sort List). If you see your list name, press Enter. 
If the correct list name is not there, then enter it. Press Enter. 

5. You should see your list in sorted order. 


Notes for the TI-89 Calculator 


In most TI-89 calculators, you must load a new operating system first and 
then the APPS statistics program. You can either go to your instructor OR 
you can go to the TI-89 site to get the operating system and the APPS 
Statistics program. You must go to the TI-89 site to get the Guidebook. It 
has all the instructions for running the statistics program. 


To go to the TI-89 site, click http://education.ti.com to get to the TI-89 link. 
Then choose "apps" and "Handheld Software Application." From this page, 
you can download the operating system (version 2.05) and statistics 
program (Statistics with List Editor - this page will have both the operating 
system and the program). 


You must then load the Advanced Mathematics Software Operating System 
and the Statistics with List Editor from your computer to the TI-89 
calculator (unless your instructor has loaded them directly into your 
calculator). For that, you will need the TI-GRAPH LINK™ software and 
the TI-GRAPH LINK™ Cable. TI sells them on their Web Site. 


Getting Started 
Create the folder “mystat” (for statistics): 


1. Press 2nd VAR-LINK. Press F1 Manage. 
2. Press 5:Create Folder. Enter the name “mystat” and press 
Enter twice. 


Make mystat your current folder: 


1. Press MODE. Arrow down to Current Folder and arrow over and 
down to mystat. 

2. Press Enter twice. You should see the name MYSTAT in the lower 
left corner. 


Creating list names L1, L2, and L3: 


1. Press APPS 1:Flashapps. Press Enter. 

2. Arrow up into the name area of the lists and over until you reach a 
blank title area. 

3. Press L (above the 4). Press alpha. Press 1. Press Enter. 

4. Arrow up into the name area of the lists and over until you reach a 
blank title area. 

5. Press L. Press alpha. Press 2. Press Enter. 

6. Arrow up into the name area of the lists and over until you reach a 
blank title area. 

7. Press L. Press alpha. Press 3. Press Enter. 


Linear Regression 


Given the data set: (3, 5), (6, 8), (9,7), (5, 20). The x List is 3, 6, 9, 5 and 


the ylist is 5, 8, 7, 20. Put the xList into L1 and the ylist into L2. 


Constructing a Scatter Plot 


1. Make sure you are using Flashapps (Press APPS 1:Flashapps. 
Press Enter.) 


2. SCATTERPLOT: After you are have entered your lists in L1 and L2, 


press F2. 

3. Press 1:Plot Setup. Highlight Plot 1 and make sure there are 
no checks next to any other plots. If there are, arrow to the plot and 
press F4. 

4. Then arrow back to Plot 1. 

5. Press F1: Define. For Plot Type, press the right arrow and 
press L: Scatter. 

6. Arrow down, press the right arrow and press 1: BOX. 

7. Arrow down to X. Press alpha L1. Arrow down to y and press 
alpha L2. 


8. Arrow down to USe Freq and Categories? and use the right 


arrow. Highlight NO and press Enter. 


9. Press Enter again. 
10. Press F5 Zoomdata. You should see the scatterplot. Press F3 to 
trace and the arrow keys to see the coordinates of the points. 


Calculating the Regression Equation 


. Press APPS, 1: Flashapps, and Enter. 

. Press F4:Calc. 

. Press 3: Regressions. 

. Press 1:LinReg (ax+b). For x List, enteralpha L1. Arrow 

down. For y List, enteralpha L2. 

5. For Store RegEqn to:, arrow right and arrow down to y1( x) 
(or any one of the y's) and press Enter. 

6. Press Enter. You should see a screen with a, b, r42, and r on it. 
The regression is complete. 

7. Write down the equation from the information and press Enter. 

8. The linear regression is yhat = 11.2267 - .2133x 
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Drawing the Regression Line 


1. Press the key with the green diamond on it (it is below the 2nd key) 
and press Y= (above the F1 key). 

2. This is the Line of Best Fit. 

3. Press the key with the green diamond on it and press GRAPH (above 
the F3 key). The line will be drawn. 


Outliers 


1. When you do Linear Regression, a list called "reSid" is created 
automatically. 

2. This list has the “y - yhat” values in it. 

3. Go into FLlashapps and into your lists. 


Ol 


20), 


21 


. Arrow to "resid" so you see it. 
. Then, arrow to the list name L3. Press F3. 
. Press 1: names. Arrow down to STATVARS. If it is not checked, 


press F4. 


. Then, arrow down to resid. Press Enter. Press “2. Press Enter. 


L3 will have the (y - yhat)/2 values. 


. Press HOME and Press F1. 8 to clear it. 
. Press 2nd MATH. Press 3:List. Press 6: Sum. Press alpha 


(L3). 


. Depending on how you have MODE Display Digits set, you 


should see approximately 137.1467. This is the SSE. 


. Calculate s. (You should be HOME.) Press clear. 
. Press the square root symbol and enter 137 .1467/2. (You get the 


denominator by taking the number of data points and subtracting 2: 
4-2=2.) 


. Press Enter. You should see 8.2809 (to 4 decimal places). 
. Press the times key and enter 1.9. Press Enter. You should see 


S27 san. 


. Press Clear. Press 2nd VAR-LINK. 
. Arrow down to resid (it is below STAT VARS). Press Enter. 


Press Enter again. 


. Arrow up to the list. Scroll through the list using the arrow keys. 
. Compare 15.7337 to the absolute values of the numbers in the list. If 


any absolute value is greater than or equal to 15.7337, then the 
corresponding point is an outlier. 


. Absolute values of the numbers in the list are approximately 5.59, 


1.95, 2.31, 9.84. 
None of them are greater than or equal to 15.7337. 


. Therefore, for this data set, no point is an outlier. 


